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BRHF1 AS A CANCER DIAGNOSTIC MARKER 



FIELD OF THE INVENTION 

5 This invention relates generally to the discovery of a novel gene, designated 

BHRF1. The invention further relates to the detection of the BHRF1 gene, BHRF1 
expression and/or the presence of the BHRF1 protein as markers for the early diagnosis 
of cancer. 

1 0 SUMMARY OF THE INVENTION 

The present invention provides purified BHRF1 polypeptides and nucleic acids 
encoding these polypeptides. 

Also provided by the present invention are antibodies that specifically bind to 
15 BHRF1 polypeptides. 

Further provided by the present invention is a method for detecting the presence 
of cancer in a subject, comprising: (a) contacting a biological sample obtained from a 
subject with at least two oligonucleotide primers, each primer consisting of 10 to 200 
contiguous nucleotides of SEQ ID NO: 5 or the complement thereof, in a reverse 
20 transcriptase polymerase chain reaction; and (b) detecting in the sample a 

polynucleotide sequence that amplifies in the presence of said oligonucleotide primers, 
wherein the presence of an amplified polynucleotide sequence indicates the presence of 
cancer in the subject. 

Also provided by the present invention is a method for detecting the presence of 
25 cancer in a subject, comprising: (a) contacting a biological sample obtained from a 
subject with a nucleic acid probe under conditions that allow the probe to selectively 
bind a BRHF1 nucleic acid; and (b) detecting the presence of a BRHF1 nucleic acid, 
whereby the presence of a BRHF1 nucleic acid indicates the presence of cancer in the 
subject. 

30 Further provided by the present invention is a method of detecting the presence 

of cancer in a subject comprising: a) contacting a sample from the subject with an 
antibody to a BRHF1 polypeptide; and b) detecting the antibody bound to the BRHF1 
polypeptide in the sample, wherein binding of BRHF1 polypeptide to the antibody 
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indicates the presence of a BRHF1 polypeptide in the sample, the presence of a BRHF1 
polypeptide indicating the presence of cancer in the subject. 

Also provided is a method of reducing BRHF1 expression in a cell comprising 
administering to the cell an antisense oligonucleotide that specifically binds to mRNA 
5 transcribed from the BRHF1 gene under conditions that allow hybridization, wherein 
the BRHF1 mRNA comprises a nucleotide sequence selected from the group consisting 
of SEQ ID NO: 3, SEQ ID NO: 4 and SEQ ID NO: 9 and wherein hybridization of the 
antisense oligonucleotide with the BRHF1 mRNA reduces BRHF1 expression. 

The present invention also provides a method of reducing BRHF1 expression 
1 0 comprising administering to a cell a ribozyme that specifically binds to mRNA 

transcribed from the BRHF1 gene, the ribozyme binding reducing BRHF1 expression. 

Also provided by the present invention is a method of reducing BRHF1 
expression comprising administering to a cell an siRNA that is complementary to at 
least a portion of the coding sequence of BRHF1, under conditions that allow 
1 5 hybridization of the siRNA with the BRHF1 coding sequence, wherein the BRHF1 

coding sequence comprises a nucleotide sequence selected from the group consisting of 
SEQ ID NO: 3 SEQ ID NO: 4 and SEQ ED NO: 9 and wherein the binding of the 
siRNA to the BRHF1 coding sequence reduces BRHF1 expression. 

Further provided by the present invention is a method of identifying a 
20 compound that reduces BRHF1 expression, comprising administering a test compound 
to a cell containing a BRHF1 gene and detecting the level of the BRHFlgene product 
produced, a decrease in the gene product as compared to a control cell indicating the 
compound reduces BRHF1 expression. 

Also provided is a method of identifying a compound that reduces BRHF1 
25 expression, comprising administering a test compound to a cell containing a nucleic 
acid encoding the polypeptide of claim 1 and detecting the level of the BRHFlgene 
product produced, a decrease in the gene product indicating a compound that reduces 
BRHF1 expression. 

The present invention also provides a method of identifying a compound that 
30 reduces BRHF1 expression in the presence of estrogen, comprising administering a test 
compound and estrogen to a cell containing a BRHF1 gene and detecting the level of 
the BRHFlgene product produced, a decrease in the gene product as compared to a 
control cell indicating a compound that reduces BRHF1 expression in the presence of 
estrogen. 
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Also provided is a method of identifying a compound that reduces BRHF1 
expression in the presence of estrogen, comprising administering a test compound and 
estrogen to a cell containing a nucleic acid encoding the polypeptide of claim 1 and 
detecting the level of the BRHFlgene product produced, a decrease in the gene product 
5 indicating a compound that reduces BRHF1 expression in the presence of estrogen. 

DESCRIPTION OF THE FIGURES 

Figure 1 shows the AP-PCR fingerprints of matched uninvolved breast tissue 
10 (N) and tumor (T) tissue DNA using OPC04 (5'CCGCATCTAC3') (SEQ ID NO: 14) 
primer. The amplification product was resolved on denaturing urea-sequencing gel and 
detected by silver staining. The two arrows on the right side of the photograph indicates 
the two single strand of 1270 bp amplification product amplified in tumors. 

15 Figure 2 illustrates chromosomes in metaphase showing the chromosomal 

localization of BRHF1 gene on 17ql 1.2 (Left panel). Two ideograms illustrating the 
chromosomal position of BRHF1 gene at 17 ql 1.2 are also provided. (Right panel). 

Figure 3 A is a Northern blot analysis of the BRHF-1 gene demonstrating a 
20 transcript size of 6.5 kb (indicated by arrow). 

Figure 3B shows the that the expected 6.5 kb BRHF1 gene transcript was 
detected by Northern blot only from RNA of MCF7(ER+ve) cells, and not from MDA- 
MB 468(ER-ve) cells. 

25 

Figure 4 is an agarose gel photograph of the BRHF1 gene transcript detected by 
RT-PCR analysis. Multifold higher level of BRHF1 gene transcript (indicated by 
arrow) was observed in tumors (T) as compared to matched normal samples. Equal 
intensity of P-actin gene amplified confirmed the same amount of RNA taken from N 
30 and T for cDNA synthesis. 

Figure 5 is a RT-PCR analysis of the BRHF1 gene transcript with the RNA 
isolated from three batches (ET1, ET2, ET3) of MCF-7cells treated with 100 pg/ml of 
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17 p-estradiol, and their corresponding untreated controls (Ci, C2, C3). Higher levels of 
BRHF1 gene expression (indicated by an arrow) were observed in estrogen-treated 
MCF7 cells. 

5 Figure 6 is an ORF region encoding 1275 amino acids representing a part of a 

novel gene of BRHF-1 gene-family and also located on human chromosome Xp22. The 
insert DNA sequence (SEQ ID NO: 5) of BRHF1 of the present invention is 
underlined; it contains FG repeats and nuclear localization signals KKKR; and shows 
significant similarity with neucleoporin FG repeat containing family proteins. Amino 
10 acid sequences in italics represents a region showing significant similarity with a 

conserved domain of an Exo endo_phos family protein, and amino acid sequences in 
bold represent a conserved domain of the reverse transcriptase family protein. 

DETAILED DESCRIPTION OF THE INVENTION 

15 Using Arbitrary-Primed PCR (AP-PCR), mutations were found in over 80% of 

breast tumor samples analyzed as compared with the uninvolved breast tissue from the 
same subject. Further characterization by cloning and sequencing of this genomic 
region harboring mutation revealed that it represents a part of a novel gene named 
BHRF1. The mutation present in breast tumor samples is a 1270 bp insertion located in 

20 chromosomal region 17qll.2. Therefore, the present invention provides methods of 
detecting the genomic region harboring this mutation as well as methods of detecting 
BHRF1 transcripts and fragments thereof. Also provided are methods of detecting a 
BHRF1 polypeptide. These methods can be utilized to detect, stage and treat cancer in 
a subject. 

25 The present invention may be understood more readily by reference to the 

following detailed description of preferred embodiments of the invention and the 
Examples included therein and to the Figures and their previous and following 
description. 

Before the present compounds, compositions, articles, devices, and/or methods 
30 are disclosed and described, it is to be understood that this invention is not limited to 
specific nucleic acids, specific polypeptides, or to particular methods, as such may, of 
course, vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only and is not intended to be limiting. 
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As used in the specification and the appended claims, the singular forms "a," 
"an" and "the" include plural referents unless the context clearly dictates otherwise. 
Thus, for example, reference to "an antibody" includes mixtures of antibodies, 
reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers, 
5 and the like. 

Ranges may be expressed herein as from "about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, another embodiment 
includes from the one particular value and/or to the other particular value. Similarly, 
when values are expressed as approximations, by use of the antecedent "about," it will 
10 be understood that the particular value forms another embodiment. It will be further 

understood that the endpoints of each of the ranges are significant both in relation to the 
other endpoint, and independently of the other endpoint. 

"Optional" or "optionally" means that the subsequently described event or 
circumstance may or may not occur, and that the description includes instances where 
1 5 said event or circumstance occurs and instances where it does not. For example, the 
phrase "optionally obtained prior to treatment" means obtained before treatment, after 
treatment, or not at all. 

As used throughout, by "subject" is meant an individual. Preferably, the subject 
is a mammal such as a primate, and, more preferably, a human. The term "subject" 
20 includes domesticated animals, such as cats, dogs, etc., livestock (e.g., cattle, horses, 
pigs, sheep, goats, etc.), and laboratory animals (e.g., mouse, rabbit, rat, gerbil, guinea 
pig, etc.). 

BRHF1 Polypeptides 

25 The present invention provides an isolated polypeptide comprising SEQ ID NO: 

1. SEQ ID NO: 1 corresponds to amino acid residues 1-105 of a full-length BHRF1 
polypeptide. The present invention also provides an isolated polypeptide comprising 
SEQ ID NO: 2 (amino acid residues 106-375 of a BHRF1 polypeptide) and an isolated 
polypeptide comprising SEQ ID NO: 8 (amino acid residues 1-375 of a BHRF1 

30 polypeptide) The present invention also provides fragments of SEQ ID NO: 1, SEQ ID 
NO: 2 and SEQ ID NO: 8. Optionally, the fragments comprise at least 10, 20, 30, 40, 
50, 60, 70, 80, 90, 100, 1 10, 120, 130, 140 or any number or amino acids less than the 
full length BHRF1 polypeptide. Such fragments include one or more conservative 
amino acid substitutions as compared to the sequence of SEQ ID NO: 8. Optionally, 
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the fragments have at least 75%, 80%, 85%, 90%, 95% 99% identity or any percentage 
in between as compared to the corresponding position of SEQ ID NO: 8. 

By "isolated polypeptide" or "purified polypeptide" is meant a polypeptide that 
is substantially free from the materials with which the polypeptide is normally 
5 associated in nature or in culture. The polypeptides of the invention can be obtained, 
for example, by extraction from a natural source if available (for example, a 
mammalian cell), by expression of a recombinant nucleic acid encoding the polypeptide 
(for example, in a cell or in a cell-free translation system), or by chemically 
synthesizing the polypeptide. In addition, polypeptide may be obtained by cleaving full 

1 0 length polypeptides. When the polypeptide is a fragment of a larger naturally occurring 
polypeptide, the isolated polypeptide is shorter than and excludes the full-length, 
naturally-occurring polypeptide of which it is a fragment. 

The polypeptides of the invention can be prepared using any of a number of 
chemical polypeptide synthesis techniques well known to those of ordinary skill in the 

15 art including solution methods and solid phase methods. One method of producing the 
polypeptides of the present invention is to link two or more peptides or polypeptides 
together by protein chemistry techniques. For example, peptides or polypeptides can be 
chemically synthesized using currently available laboratory equipment using either 
Fmoc (9-fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) chemistry. 

20 (Applied Biosystems, Inc., Foster City, CA). One skilled in the art can readily 

appreciate that a peptide or polypeptide corresponding to the antibody of the present 
invention, for example, can be synthesized by standard chemical reactions. For 
example, a peptide or polypeptide can be synthesized and not cleaved from its synthesis 
resin, whereas the other fragment of an antibody can be synthesized and subsequently 

25 cleaved from the resin, thereby exposing a terminal group which is functionally 

blocked on the other fragment. By peptide condensation reactions, these two fragments 
can be covalently joined via a peptide bond at their carboxyl and amino termini, 
respectively, to form an antibody, or fragment thereof. (Grant GA (1992) Synthetic 
Peptides: A User Guide. W.H. Freeman and Co., N.Y. (1992); Bodansky M and Trost 

30 B., Ed. (1993) Principles of Peptide Synthesis. Springer- Verlag Inc., NY. 

Alternatively, the peptide or polypeptide is independently synthesized in vivo as 
described above. Once isolated, these independent peptides or polypeptides may be 
linked to form an antibody or fragment thereof via similar peptide condensation 
reactions. 
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For example, enzymatic ligation of cloned or synthetic peptide segments allow 
relatively short peptide fragments to be joined to produce larger peptide fragments, 
polypeptides or whole protein domains (Abrahmsen L et al., Biochemistry, 30:4151 

(1991) ). Alternatively, native chemical ligation of synthetic peptides can be utilized to 
5 synthetically construct large peptides or polypeptides from shorter peptide fragments. 

This method consists of a two-step chemical reaction (Dawson et al. Synthesis of 
Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). The first step is 
the chemoselective reaction of an unprotected synthetic peptide-alpha-thioester with 
another unprotected peptide segment containing an amino-terminal Cys residue to give 

10 a thioester-linked intermediate as the initial covalent product. Without a change in the 
reaction conditions, this intermediate undergoes spontaneous, rapid intramolecular 
reaction to form a native peptide bond at the ligation site. Application of this native 
chemical ligation method to the total synthesis of a protein molecule is illustrated by 
the preparation of human interleukin 8 (IL-8) (Baggiolini M et al. (1992) FEBS Lett. 

15 307:97-101; Clark-Lewis I et al., J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., 
Biochemistry, 30:3128 (1991); Rajarathnam K et al., Biochemistry 33:6623-30 (1994)). 

Alternatively, unprotected peptide segments are chemically linked where the 
bond formed between the peptide segments as a result of the chemical ligation is an 
unnatural (non-peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)). This 

20 technique has been used to synthesize analogs of protein domains as well as large 

amounts of relatively pure proteins with full biological activity (deLisle Milton RC et 
al., Techniques in Protein Chemistry IV. Academic Press, New York, pp. 257-267 

(1992) ). 

The polypeptides of the invention can also be prepared by other means 
25 including, for example, recombinant techniques. Examples of appropriate cloning and 
sequencing techniques, and instructions sufficient to direct persons of skill through 
many cloning exercises are found in Sambrook et al. (2001) Molecular Cloning - A 
Laboratory Manual (3rd ed.) Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor Press, NY, (Sambrook). 
30 Also provided by the present invention is a polypeptide comprising an amino 

acid sequence at least about 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 
identical to a sequence selected from the group consisting of: SEQ ID NO: 1, SEQ ID 
NO: 2 and SEQ ID NO: 8. 
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It is understood that as discussed herein the use of the terms "homology" and 
"identity" mean the same thing as similarity. Thus, for example, if the use of the word 
homology is used to refer to two non-natural sequences, it is understood that this is not 
necessarily indicating an evolutionary relationship between these two sequences, but 
5 rather is looking at the similarity or relatedness between their nucleic acid sequences. 
Many of the methods for determining homology between two evolutionarily related 
molecules are routinely applied to any two or more nucleic acids or proteins for the 
purpose of measuring sequence similarity regardless of whether they are evolutionarily 
related. 

10 In general, it is understood that one way to define any known variants and 

derivatives or those that might arise, of the disclosed nucleic acids and polypeptides 
herein, is through defining the variants and derivatives in terms of homology to specific 
known sequences. In general, variants of nucleic acids and polypeptides herein 
disclosed typically have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 

15 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to 
the stated sequence or the native sequence. Those of skill in the art readily understand 
how to determine the homology of two polypeptides or nucleic acids. For example, the 
homology can be calculated after aligning the two sequences so that the homology is at 
its highest level. 

20 Another way of calculating homology can be performed by published 

algorithms. Optimal alignment of sequences for comparison may be conducted by the 
local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by 
the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol. 48: 443 
(1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. 

25 Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms 

(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI; the BLAST algorithm of 
Tatusova and Madden FEMS Microbiol. Lett. 174: 247-250 (1999) available from the 
National Center for Biotechnology Information 

30 (http://www.ncbi.nlm.nih.gov/blast/bl2seq/bl2.html) ), or by inspection. 

The same types of homology can be obtained for nucleic acids by for example 
the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. 
Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989 
which are herein incorporated by reference for at least material related to nucleic acid 
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alignment. It is understood that any of the methods typically can he used and that in 
certain instances the results of these various methods may differ, but the skilled artisan 
understands if identity is found with at least one of these methods, the sequences would 
be said to have the stated identity. 
5 For example, as used herein, a sequence recited as having a particular percent 

homology to another sequence refers to sequences that have the recited homology as 
calculated by any one or more of the calculation methods described above. For 
example, a first sequence has 80 percent homology, as defined herein, to a second 
sequence if the first sequence is calculated to have 80 percent homology to the second 

10 sequence using the Zuker calculation method even if the first sequence does not have 
80 percent homology to the second sequence as calculated by any of the other 
calculation methods. As another example, a first sequence has 80 percent homology, as 
defined herein, to a second sequence if the first sequence is calculated to have 80 
percent homology to the second sequence using both the Zuker calculation method and 

15 the Pearson and Lipman calculation method even if the first sequence does not have 80 
percent homology to the second sequence as calculated by the Smith and Waterman 
calculation method, the Needleman and Wunsch calculation method, the Jaeger 
calculation methods, or any of the other calculation methods. As yet another example, 
a first sequence has 80 percent homology, as defined herein, to a second sequence if the 

20 first sequence is calculated to have 80 percent homology to the second sequence using 
each of calculation methods (although, in practice, the different calculation methods 
will often result in different calculated homology percentages). 

Also provided by the present invention is a polypeptide comprising an amino 
acid sequence selected from the group consisting of: SEQ ID NO: 1, SEQ ID NO: 2 

25 and SEQ ID NO: 8 with one or more conservative amino acid substitutions. These 

conservative substitutions are such that a naturally occurring amino acid is replaced by 
one having similar properties. Such conservative substitutions do not alter the function 
of the polypeptide. For example, conservative substitutions can be made according to 
the following table: 

30 
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TABLE 1 : 


Amino Acid Substitutions 


Original Residue 


Exemplary Substitutions 


Arg 


Lys 


Asn 


Gin 


Asp 

r 


Glu 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


His 


Gin 


lie 


leu; val 


Leu 


ile; val 


Lys 


arg; gin 


Met 


leu; ile 


Phe 


met; leu; tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


trp; phe 


Val 


ile; leu 



Thus, it is understood that, where desired, modifications and changes may be 
made in the nucleic acid encoding the polypeptides of this invention and/or amino acid 
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sequence of the polypeptides of the present invention and still obtain a polypeptide 
having like or otherwise desirable characteristics. Such changes may occur in natural 
isolates or may be synthetically introduced using site-specific mutagenesis, the 
procedures for which, such as mis-match polymerase chain reaction (PCR), are well 
5 known in the art. For example, certain amino acids may be substituted for other amino 
acids in a polypeptide without appreciable loss of functional activity. It is thus 
contemplated that various changes may be made in the amino acid sequence of the 
polypeptides of the present invention (or underlying nucleic acid sequence) without 
appreciable loss of biological utility or activity and possibly with an increase in such 

10 utility or activity. 

Further provided by the invention are antigenic fragments of SEQ ID NO: 1, 
SEQ ID NO: 2 and SEQ ID NO: 8. It is also understood that a BHRF1 gene product, 
such as the BHRF1 polypeptides comprising SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID 
NO: 8, can exert a biological function associated with the BHRF1 gene. An active 

1 5 BHRF1 gene product may act as a transcriptional activator to elicit downstream effects. 
The BHRF1 gene product may also function from outside of the cell either as a ligand 
which can bind to its cell surface receptor and participate in signal transduction, or it 
may function through the interaction with other secreted proteins and extracellular 
matrix molecules outside of cells. Any effect associated with BHRF1 associated 

20 cancers is also contemplated by this invention. 

Nucleic Acids 

The present invention also provides nucleic acids that encode BHRF1 
polypeptides. The present invention also provides a nucleic acid encoding a 

25 polypeptide comprising a sequence selected from the group consisting of: SEQ ID NO: 
1, SEQ ID NO: 2 and SEQ ID NO: 8. The present invention also provides a nucleic 
acid sequence comprising a nucleic acid sequence corresponding to a genomic insertion 
(SEQ ID NO: 5) of approximately 1270 bp found in the mutated BHRF1 gene 
associated with cancer. Also provided are nucleic acid sequences that encode 

30 fragments of BHRF1 polypeptides as disclosed herein. 

An example of a nucleic acid encoding a polypeptide comprising SEQ ID NO: 
1, is provided herein as SEQ ID NO: 3. An example of a nucleic acid encoding a 
polypeptide comprising SEQ ID NO: 2, is provided herein as SEQ ID NO: 4. An 
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example of a nucleic acid encoding a polypeptide comprising SEQ ID NO: 8 is 
provided herein as SEQ ID NO: 9. 

As used herein, the term "nucleic acid" refers to single or multiple stranded 
molecules which may be DNA or RNA, or any combination thereof, including 
5 modifications to those nucleic acids. The nucleic acid may represent a coding strand or 
its complement, or any combination thereof. Nucleic acids may be identical in 
sequence to the sequences which are naturally occurring for any of the moieties 
discussed herein or may include alternative codons which encode the same amino acid 
as that which is found in the naturally occurring sequence. These nucleic acids can also 

10 be modified from their typical structure. Such modifications include, but are not 

limited to, methylated nucleic acids, the substitution of a non-bridging oxygen on the 
phosphate residue with either a sulfur (yielding phosphorothioate deoxynucleotides), 
selenium (yielding phosphorselenoate deoxynucleotides), or methyl groups (yielding 
methylphosphonate deoxynucleotides), a reduction in the AT content of AT rich 

15 regions, or replacement of non-preferred codon usage of the expression system to 
preferred codon usage of the expression system. 

The nucleic acid can be directly cloned into an appropriate vector, or if desired, 
can be modified to facilitate the subsequent cloning steps. Such modification steps are 
routine, an example of which is the addition of oligonucleotide linkers which contain 

20 restriction sites to the termini of the nucleic acid. General methods are set forth in 
Sambrook et ah, "Molecular Cloning, a Laboratory Manual," Cold Spring Harbor 
Laboratory Press (1989). 

Once the nucleic acid sequence is obtained, the sequence encoding the specific 
amino acids can be modified or changed at any particular amino acid position by 

25 techniques well known in the art. For example, PCR primers can be designed which 
span the amino acid position or positions and which can substitute any amino acid for 
another amino acid. Alternatively, one skilled in the art can introduce specific 
mutations at any point in a particular nucleic acid sequence through techniques for 
point mutagenesis. General methods are set forth in Smith, M. "In vitro mutagenesis" 

30 Ann. Rev. Gen., 19:423-462 (1985) and Zoller, M.J. "New molecular biology methods 
for protein engineering" Curr. Opin. Struct. Biol., 1:605-610 (1991), which are 
incorporated herein in their entirety for the methods. These techniques can be used to 
alter the coding sequence without altering the amino acid sequence that is encoded. 
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Vectors, Cells, and Methods of Using 

Also provided is a vector, comprising a nucleic acid of the present invention. 
The vector can direct the in vivo or in vitro synthesis of any of the polypeptides 
described herein. The vector is contemplated to have the necessary functional elements 
5 that direct and regulate transcription of the inserted nucleic acid. These functional 

elements include, but are not limited to, a promoter, regions upstream or downstream of 
the promoter, such as enhancers that may regulate the transcriptional activity of the 
promoter, an origin of replication, appropriate restriction sites to facilitate cloning of 
inserts adjacent to the promoter, antibiotic resistance genes or other markers which can 

10 serve to select for cells containing the vector or the vector containing the insert, RNA 
splice junctions, a transcription termination region, or any other region which may 
serve to facilitate the expression of the inserted gene or hybrid gene. (See generally, 
Sambrook et al). The vector, for example, can be a plasmid. The vectors can contain 
genes conferring hygromycin resistance, gentamicin resistance, or other genes or 

15 phenotypes suitable for use as selectable markers, or methotrexate resistance for gene 
amplification. The vector can comprise the nucleic acid in pET15b, pSRa-Neo, 
pPICZa, or pPIC9K. 

There are numerous other E. coli (Escherichia coli) expression vectors, known 
to one of ordinary skill in the art, which are useful for the expression of the nucleic acid 

20 insert. Other microbial hosts suitable for use include bacilli, such as Bacillus subtilis, 
and other enterobacteriaceae, such as Salmonella, Serratia, and various Pseudomonas 
species. In these prokaryotic hosts one can also make expression vectors, which will 
typically contain expression control sequences compatible with the host cell (e.g., an 
origin of replication). In addition, any number of a variety of well-known promoters 

25 will be present, such as the lactose promoter system, a tryptophan (Trp) promoter 

system, a beta- lactamase promoter system, or a promoter system from phage lambda. 
The promoters will typically control expression, optionally with an operator sequence, 
and have ribosome binding site sequences for example, for initiating and completing 
transcription and translation. If necessary, an amino terminal methionine can be 

30 provided by insertion of a Met codon 5' and in- frame with the downstream nucleic acid 
insert. Also, the carboxy-terminal extension of the nucleic acid insert can be removed 
using standard oligonucleotide mutagenesis procedures. Also, nucleic acid 
modifications can be made to promote amino terminal homogeneity. 
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Additionally, yeast expression can be used. The invention provides a nucleic 
acid encoding a polypeptide of the present invention, wherein the nucleic acid can be 
expressed by a yeast cell. More specifically, the nucleic acid can be expressed by 
Pichia pastoris or S. cerevisiae. There are several advantages to yeast expression 
5 systems, which include, for example, Saccharomyces cerevisiae and Pichia pastoris. 
First, evidence exists that proteins produced in a yeast secretion systems exhibit correct 
disulfide pairing. Second, efficient large scale production can be carried out using 
yeast expression systems. The Saccharomyces cerevisiae pre-pro-alpha mating factor 
leader region (encoded by the MFa-1 gene) can be used to direct protein secretion from 

10 yeast (Brake, et al.). The leader region of pre-pro-alpha mating factor contains a signal 
peptide and a pro-segment which includes a recognition sequence for a yeast protease 
encoded by the KEX2 gene: this enzyme cleaves the precursor protein on the carboxyl 
side of a Lys-Arg dipeptide cleavage signal sequence. The nucleic acid coding 
sequence can be fused in- frame to the pre-pro-alpha mating factor leader region. This 

15 construct can be put under the control of a strong transcription promoter, such as the 
alcohol dehydrogenase I promoter, alcohol oxidase I promoter, a glycolytic promoter, 
or a promoter for the galactose utilization pathway. The nucleic acid coding sequence 
is followed by a translation termination codon which is followed by transcription 
termination signals. Alternatively, the nucleic acid coding sequences can be fused to a 

20 second protein coding sequence, such as Sj26 or beta-galactosidase, used to facilitate 
purification of the fusion protein by affinity chromatography. The insertion of protease 
cleavage sites to separate the components of the fusion protein is applicable to 
constructs used for expression in yeast. Efficient post translational glycosylation and 
expression of recombinant proteins can also be achieved in Baculovirus systems. 

25 Mammalian cells permit the expression of proteins in an environment that 

favors important post-translational modifications such as folding and cysteine pairing, 
addition of complex carbohydrate structures, and secretion of active protein. Vectors 
useful for the expression of active proteins in mammalian cells are characterized by 
insertion of the protein coding sequence between a strong viral promoter and a 

30 polyadenylation signal. The vectors can contain genes conferring hygromycin 

resistance, genticin or G4 1 8 resistance, or other genes or phenotypes suitable for use as 
selectable markers, or methotrexate resistance for gene amplification. The chimeric 
protein coding sequence can be introduced into a Chinese hamster ovary (CHO) cell 
line using a methotrexate resistance-encoding vector, or other cell lines using suitable 
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selection markers. Presence of the vector DNA in transformed cells can be confirmed 
by Southern blot analysis. Production of RNA corresponding to the insert coding 
sequence can be confirmed by Northern blot analysis. A number of other suitable host 
cell lines capable of secreting intact human proteins have been developed in the art, and 
5 include the CHO cell lines, HeLa cells, myeloma cell lines, Jurkat cells, etc. 

Expression vectors for these cells can include expression control sequences, such as an 
origin of replication, a promoter, an enhancer, and necessary information processing 
sites, such as ribosome binding sites, RNA splice sites, polyadenylation sites, and 
transcriptional terminator sequences. Preferred expression control sequences are 

10 promoters derived from immunoglobulin genes, SV40, Adenovirus, Bovine Papilloma 
Virus, etc. The vectors containing the nucleic acid segments of interest can be 
transferred into the host cell by well-known methods, which vary depending on the type 
of cellular host. For example, calcium chloride transformation is commonly utilized for 
prokaryotic cells, whereas calcium phosphate, DEAE dextran, or lipofectin mediated 

15 transfection or electroporation may be used for other eukaryotic cellular hosts. 

Alternative vectors for the expression of genes or nucleic acids in mammalian 
cells, those similar to those developed for the expression of human gamma-interferon, 
tissue plasminogen activator, clotting Factor VIII, hepatitis B virus surface antigen, 
protease Nexinl, and eosinophil major basic protein, can be employed. Further, the 

20 vector can include CMV promoter sequences and a polyadenylation signal available for 
expression of inserted nucleic acids in mammalian cells (such as COS-7). 

Insect cells also permit the expression of mammalian proteins. Recombinant 
proteins produced in insect cells with baculovirus vectors undergo post-translational 
modifications similar to that of wild-type proteins. Briefly, baculovirus vectors useful 

25 for the expression of active proteins in insect cells are characterized by insertion of the 
protein coding sequence downstream of the Autographica californica nuclear 
polyhedrosis virus (AcNPV) promoter for the gene encoding polyhedrin, the major 
occlusion protein. Cultured insect cells such as Spodoptera frugiperda cell lines are 
transfected with a mixture of viral and plasmid DNAs and the viral progeny are plated. 

30 Deletion or insertional inactivation of the polyhedrin gene results in the production of 
occlusion negative viruses which form plaques that are distinctively different from 
those of wild- type occlusion positive viruses. These distinctive plaque morphologies 
allow visual screening for recombinant viruses in which the AcNPV gene has been 
replaced with a hybrid gene of choice. 
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The invention also provides for the vectors containing the contemplated nucleic 
acids in a host suitable for expressing the nucleic acids. The host cell can be a 
prokaryotic cell, including, for example, a bacterial cell. More particularly, the 
bacterial cell can be an E. coli cell. Alternatively, the cell can be a eukaryotic cell, 
5 including, for example, a Chinese hamster ovary (CHO) cell, a myeloma cell, a Pichia 
cell, or an insect cell. The coding sequence for any of the polypeptides described 
herein can be introduced into a Chinese hamster ovary (CHO) cell line, for example, 
using a methotrexate resistance-encoding vector, or other cell lines using suitable 
selection markers. Presence of the vector DNA in transformed cells can be confirmed 

10 by Southern blot analysis. Production of RNA corresponding to the insert coding 

sequence can be confirmed by Northern blot analysis. A number of other suitable host 
cell lines have been developed and include myeloma cell lines, fibroblast cell lines, and 
a variety of tumor cell lines such as melanoma cell lines. Expression vectors for these 
cells can include expression control sequences, such as an origin of replication, a 

1 5 promoter, an enhancer, and necessary information processing sites, such as ribosome 
binding sites, RNA splice sites, polyadenylation sites, and transcriptional terminator 
sequences. Preferred expression control sequences are promoters derived from 
immunoglobulin genes, SV40, Adenovirus, Bovine Papilloma Virus, etc. The vectors 
containing the nucleic acid segments of interest can be transferred into the host cell by 

20 well-known methods, which vary depending on the type of cellular host. For example, 
calcium chloride transformation is commonly utilized for prokaryotic cells, whereas 
calcium phosphate, DEAE dextran, or lipofectin mediated transfection or 
electroporation may be used for other cellular hosts. 

The present invention provides a method of making any of the polypeptides 

25 derived from the BHRF 1 polypeptide described herein comprising: culturing a host 
cell comprising a vector that encodes a BHRF 1 polypeptide and purifying the 
polypeptide produced by the host cell. As mentioned above, these polypeptides 
include, but are not limited to, a polypeptide comprising SEQ ED NO: 1, SEQ ED NO: 2 
or SEQ ID NO: 8. Such polypeptides can also include variants of SEQ ED NO: 1, SEQ 

30 ED NO: 2 or SEQ ED NO: 8 or fragments thereof as described herein. 



Antibodies 

The present invention further provides an isolated antibody or fragment thereof 
that specifically binds an epitope contained within amino acids 1-105 of the BHRF1 
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polypeptide. In other words, the present invention provides an isolated antibody or 
fragment thereof that specifically binds an epitope contained within the amino acid 
sequence of SEQ ID NO: 1 . Further provided by the present invention is an isolated 
antibody or fragment thereof that specifically binds an epitope contained within amino 
5 acids 106-375 of the BHRF1 polypeptide (SEQ ID NO: 2). The present invention also 
provides an antibody that specifically binds to an epitope contained within or 
overlapping with amino acids 1-105 of the BHRF1 polypeptide, as evidenced by 
competitive binding studies. The present invention also provides an antibody that 
specifically binds to an epitope contained within or overlapping with amino acids 106- 

10 375 of the BHRF1 polypeptide, as evidenced by competitive binding studies. 

Competitive binding studies are well known in the art. For example, if a test antibody 
competes for binding to the BFIRF1 polypeptide with an antibody or ligand that 
specifically binds an epitope contained within amino acids 1-105 of the BHRF1 
polypeptide, one of skill in the art would readily know that the test antibody binds the 

1 5 same epitope as the antibody or ligand that specifically binds to an epitope contained 
within or overlapping with amino acids 1-105 of the BHRF1 polypeptide. Similarly, if 
a test antibody competes for binding to the BHRF1 polypeptide with an antibody or 
ligand that specifically binds an epitope contained within amino acids 106-375 of the 
BHRF1 polypeptide, one of skill in the art would readily know that the test antibody 

20 binds the same epitope as the antibody or ligand that specifically binds to an epitope 
contained within or overlapping with amino acids 106-375 of the BHRF1 polypeptide. 

The antibody of the present invention can be a polyclonal antibody or a 
monoclonal antibody. The antibody of the invention selectively binds a BHRF1 
polypeptide. By "selectively binds" or "specifically binds" is meant an antibody 

25 binding reaction which is determinative of the presence of the antigen (in the present 
case, a BHRF1 polypeptide or antigenic fragments thereof) among a heterogeneous 
population of proteins and other biologies. Thus, under designated immunoassay 
conditions, the specified antibodies bind preferentially to a particular peptide and do 
not bind in a significant amount to other proteins in the sample. Specific binding to a 

30 BHRF1 polypeptide under such conditions requires an antibody that is selected for its 
specificity to a BHRF1 polypeptide. Preferably, selective binding includes binding at 
about or above 1.5 times assay background and the absence of significant binding is 
less than 1 .5 times assay background. 
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This invention also contemplates antibodies that compete for binding to natural 
BHRF1 interactors. For example, an antibody of the present invention can compete 
with BHRFT for a binding site (e.g. a receptor) on a cell or the antibody can compete 
with BHRF1 for binding to another protein or biological molecule. The antibody 
5 optionally can have either an antagonistic or agonists function as compared to the 
antigen. Preferably, the antibody binds a BHRF1 polypeptide ex vivo or in vivo. 
Optionally, the antibody of the invention is labeled with a detectable moiety. For 
example, the detectable moiety can be selected from the group consisting of a 
fluorescent moiety, an enzyme-linked moiety, a biotin moiety and a radiolabeled 

10 moiety. The antibody can be used in techniques or procedures such as diagnostics, 
screening, or imaging. Anti-idiotypic antibodies and affinity matured antibodies are 
also considered to be part of the invention. 

As used herein, the term "antibody" encompasses, but is not limited to, whole 
immunoglobulin (i.e., an intact antibody) of any class. Native antibodies are usually 

15 heterotetrameric glycoproteins, composed of two identical light (L) chains and two 
identical heavy (H) chains. Typically, each light chain is linked to a heavy chain by 
one covalent disulfide bond, while the number of disulfide linkages varies between the 
heavy chains of different immunoglobulin isotypes. Each heavy and light chain also 
has regularly spaced intrachain disulfide bridges. Each heavy chain has at one end a 

20 variable domain (V(H)) followed by a number of constant domains. Each light chain 
has a variable domain at one end (V(L)) and a constant domain at its other end; the 
constant domain of the light chain is aligned with the first constant domain of the heavy 
chain, and the light chain variable domain is aligned with the variable domain of the 
heavy chain. Particular amino acid residues are believed to form an interface between 

25 the light and heavy chain variable domains. The light chains of antibodies from any 
vertebrate species can be assigned to one of two clearly distinct types, called kappa (k) 
and lambda (1), based on the amino acid sequences of their constant domains. 
Depending on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are five major classes of 

30 human immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be 
further divided into subclasses (isotypes), e.g., IgG-1, IgG-2, IgG-3, and IgG-4; IgA-1 
and IgA-2. One skilled in the art would recognize the comparable classes for mouse or 
other species. The heavy chain constant domains that correspond to the different classes 
of immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively. 
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As used herein, the terms "immunoglobulin heavy chain or fragments thereof 
and "immunoglobulin light chain or fragments thereof encompass chimeric peptides 
and hybrid peptides, with dual or multiple antigen or epitope specificities, and 
fragments, including hybrid fragments. Thus, fragments of the heavy chains and/or 
5 fragments of the light chains that retain the ability to bind their specific antigens are 
provided. For example, fragments of the heavy chains and/or fragments of the light 
chains that maintain BHRF1 protein binding activity are included within the meaning 
of the terms "immunoglobulin heavy chain or fragments thereof and "immunoglobulin 
light chain and fragments thereof," respectively. Such heavy chains and light chains 

10 and fragments thereof, respectively, can be made by techniques known in the art and 
can be screened for specificity and activity according to the methods set forth in the 
Examples and in general methods for producing antibodies and screening antibodies for 
specificity and activity (See Harlow and Lane. Antibodies, A Laboratory Manual. Cold 
Spring Harbor Publications, New York, (1988)). 

1 5 The term "variable" is used herein to describe certain portions of the variable 

domains that differ in sequence among antibodies and are used in the binding and 
specificity of each particular antibody for its particular antigen. However, the 
variability is not usually evenly distributed through the variable domains of antibodies. 
It is typically concentrated in three segments called complementarity determining 

20 regions (CDRs) or hypervariable regions both in the light chain and the heavy chain 
variable domains. The more highly conserved portions of the variable domains are 
called the framework (FR). The variable domains of native heavy and light chains each 
comprise four FR regions, largely adopting a P-sheet configuration, connected by three 
CDRs, which form loops connecting, and in some cases forming part of, the P-sheet 

25 structure. The CDRs in each chain are held together in close proximity by the FR 
regions and, with the CDRs from the other chain, contribute to the formation of the 
antigen binding site of antibodies (see Kabat E. A. et al., "Sequences of Proteins of 
Immunological Interest," National Institutes of Health, Bethesda, Md. (1987)). The 
constant domains are not involved directly in binding an antibody to an antigen, but 

30 exhibit various effector functions, such as participation of the antibody in antibody- 
dependent cellular toxicity. 

As used herein, the term "antibody or fragments thereof encompasses chimeric 
antibodies and hybrid antibodies, with dual or multiple antigen or epitope specificities, 
and fragments, such as F(ab')2, Fab', Fab and the like, including hybrid fragments. 
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Thus, fragments of the antibodies that retain the ability to bind their specific antigens 
are provided. For example, fragments of antibodies which maintain BHRF1 protein 
binding activity are included within the meaning of the term "antibody or fragment 
thereof." Such antibodies and fragments can be made by techniques known in the art 
5 and can be screened for specificity and activity according to the methods set forth in the 
Examples and in general methods for producing antibodies and screening antibodies for 
specificity and activity (See Harlow and Lane. Antibodies, A Laboratory Manual. Cold 
Spring Harbor Publications, New York, (1988)). 

Also included within the meaning of "antibody or fragments thereof are 

10 conjugates of antibody fragments and antigen binding proteins (single chain antibodies) 
as described, for example, in U.S. Pat. No. 4,704,692, the contents of which are hereby 
incorporated by reference. 

Optionally, the antibodies are generated in other species and "humanized" for 
administration in humans. In one embodiment of the invention, the "humanized" 

15 antibody is a human version of the antibody produced by a germ line mutant animal. 
Humanized forms of non-human (e.g., murine) antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab')2, or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies include 

20 human immunoglobulins (recipient antibody) in which residues from a CDR of the 
recipient are replaced by residues from a CDR of a non-human species (donor 
antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In one embodiment, the present invention provides a humanized version of 
an antibody, comprising at least one, two, three, four, or up to all CDRs of a BHRF1 

25 monoclonal antibody. In some instances, Fv framework residues of the human 
immunoglobulin are replaced by corresponding non-human residues. Humanized 
antibodies may also comprise residues that are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody 
will comprise substantially all of or at least one, and typically two, variable domains, in 

30 which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the FR regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will 
comprise at least a portion of an immunoglobulin constant region (Fc), typically that of 
a human immunoglobulin (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., 
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Nature, 332:323-327 (1988); and Presta, Curr. Op. Struct. Biol., 2:593-596 (1992)). 

Methods for humanizing non-human antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into 
it from a source that is non-human. These non-human amino acid residues are often 
5 referred to as "import" residues, which are typically taken from an "import" variable 
domain. Humanization can be essentially performed following the method of Winter 
and co-workers (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature, 
332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)), by substituting 
rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. 

10 Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Pat. No. 

4,816,567), wherein substantially less than an intact human variable domain has been 
substituted by the corresponding sequence from a non-human species. In practice, 
humanized antibodies are typically human antibodies in which some CDR residues and 
possibly some FR residues are substituted by residues from analogous sites in rodent 

1 5 antibodies. 

The choice of human variable domains, both light and heavy, to be used in 
making the humanized antibodies is very important in order to reduce antigenicity. 
According to the "best-fit" method, the sequence of the variable domain of a rodent 
antibody is screened against the entire library of known human variable domain 

20 sequences. The human sequence which is closest to that of the rodent is then accepted 
as the human framework (FR) for the humanized antibody (Sims et al., J. Immunol., 
151:2296 (1993) and Chothia et al., J. Mol. Biol., 196:901 (1987)). Another method 
uses a particular framework derived from the consensus sequence of all human 
antibodies of a particular subgroup of light or heavy chains. The same framework may 

25 be used for several different humanized antibodies (Carter et al., Proc. Natl. Acad. Sci. 
USA, 89:4285 (1992); Presta et al., J. Immunol., 151:2623 (1993)). 

It is important that antibodies be humanized with retention of high affinity for 
the antigen and other favorable biological properties. To achieve this goal, according to 
one method, humanized antibodies are prepared by a process of analysis of the parental 

30 sequences and various conceptual humanized products using three dimensional models 
of the parental and humanized sequences. Three dimensional immunoglobulin models 
are commonly available and are familiar to those skilled in the art. Computer programs 
are available which illustrate and display probable three-dimensional conformational 
structures of selected candidate immunoglobulin sequences. Inspection of these 
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displays permits analysis of the likely role of the residues in the functioning of the 
candidate immunoglobulin sequence, i.e., the analysis of residues that influence the 
ability of the candidate immunoglobulin to bind its antigen. In this way, FR residues 
can be selected and combined from the consensus and import sequence so that the 
5 desired antibody characteristic, such as increased affinity for the target antigen(s), is 
achieved. In general, the CDR residues are directly and most substantially involved in 
influencing antigen binding (see, WO 94/04679, published 3 March 1994). 

Transgenic animals (e.g., mice) that are capable, upon immunization, of 
producing a full repertoire of human antibodies in the absence of endogenous 

10 immunoglobulin production can be employed. For example, it has been described that 
the homozygous deletion of the antibody heavy chain joining region (J(H)) gene in 
chimeric and germ-line mutant mice results in complete inhibition of endogenous 
antibody production. Transfer of the human germ-line immunoglobulin gene array in 
such germ-line mutant mice will result in the production of human antibodies upon 

15 antigen challenge (see, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. USA, 90:2551-255 
(1993); Jakobovits et al., Nature, 362:255-258 (1993); Bruggemann et al., Year in 
Immuno., 7:33 (1993)). Human antibodies can also be produced in phage display 
libraries (Hoogenboom et al., J. Mol. Biol., 227:381 (1991); Marks et al., J. Mol. Biol., 
222:581 (1991)). The techniques of Cote et al. and Boerner et al. are also available for 

20 the preparation of human monoclonal antibodies (Cole et al., Monoclonal Antibodies 
and Cancer Therapy, Alan R. Liss, p. 77 (1985); Boerner et al., J. Immunol., 147(1):86- 
95 (1991)). 

The present invention further provides a hybridoma cell that produces the 
monoclonal antibody of the invention. An example of such a hybridoma cell is a 

25 hybridoma cell which produces a monoclonal antibody that specifically binds an 

epitope contained within amino acids 1-105 of the BHRF1 polypeptide. The present 
invention further provides a hybridoma cell which produces a monoclonal antibody that 
specifically binds an epitope contained within amino acids 106-375 of the BHRF1 
polypeptide. Also provided by the present invention is a hybridoma cell which 

30 produces a monoclonal antibody that specifically binds an epitope contained within 
amino acids 1-375 of the BHRF1 polypeptide. 

The term "monoclonal antibody" as used herein refers to an antibody obtained 
from a substantially homogeneous population of antibodies, i.e., the individual 
antibodies comprising the population are identical except for possible naturally 
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occurring mutations that may be present in minor amounts. The monoclonal antibodies 
herein specifically include "chimeric" antibodies in which a portion of the heavy and/or 
light chain is identical with or homologous to corresponding sequences in antibodies 
derived from a particular species or belonging to a particular antibody class or subclass, 
5 while the remainder of the chain(s) is identical with or homologous to corresponding 
sequences in antibodies derived from another species or belonging to another antibody 
class or subclass, as well as fragments of such antibodies, so long as they exhibit the 
desired activity (See, U.S. Pat. No. 4,816,567 and Morrison et al., Proc. Natl. Acad. 
Sci. USA, 81:6851-6855 (1984)). 

10 Monoclonal antibodies of the invention may be prepared using hybridoma 

methods, such as those described by Kohler and Milstein, Nature, 256:495 (1975) or 
Harlow and Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, 
New York, (1988). In a hybridoma method, a mouse or other appropriate host animal, 
is typically immunized with an immunizing agent to elicit lymphocytes that produce or 

15 are capable of producing antibodies that will specifically bind to the immunizing agent. 
Alternatively, the lymphocytes may be immunized in vitro. Preferably, the immunizing 
agent comprises a polypeptide of the present invention. Thus, the antibodies of the 
present invention specifically bind BHRF1 protein. 

Traditionally, the generation of monoclonal antibodies has depended on the 

20 availability of purified protein or peptides for use as the immunogen. More recently 
DNA based immunizations have shown promise as a way to elicit strong immune 
responses and generate monoclonal antibodies. In this approach, DNA-based 
immunization can be used, wherein DNA encoding a portion of BHRE1 protein, for 
example, expressed as a fusion protein with human IgGl is injected into the host 

25 animal according to methods known in the art (e.g., Kilpatrick KE, et al. Gene gun 

delivered DNA-based immunizations mediate rapid production of murine monoclonal 
antibodies to the Flt-3 receptor. Hybridoma. 1998 Dec;17(6):569-76; Kilpatrick KE et 
al. High-affinity monoclonal antibodies to PED/PEA-15 generated using 5 microg of 
DNA. Hybridoma. 2000 Aug;19(4):297-302, which are incorporated herein by 

30 reference in full for the the methods of antibody production). 

An alternate approach to immunizations with either purified protein or DNA is 
to use antigen expressed in baculovirus. The advantages to this system include ease of 
generation, high levels of expression, and post-translational modifications that are 
highly similar to those seen in mammalian systems. Use of this system involves 
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expressing, for example, domains of BHRF1 protein antibody as fusion proteins. The 
antigen is produced by inserting a gene fragment in-frame between the signal sequence 
and the mature protein domain, for example, of the BHRF1 protein antibody nucleotide 
sequence. This results in the display of the foreign proteins on the surface of the virion. 
5 This method allows immunization with whole virus, eliminating the need for 
purification of target antigens. 

Generally, either peripheral blood lymphocytes ("PBLs") are used in methods 
of producing monoclonal antibodies if cells of human origin are desired, or spleen cells 
or lymph node cells are used if non-human mammalian sources are desired. The 

10 lymphocytes aire then fused with an immortalized cell line using a suitable fusing agent, 
such as polyethylene glycol, to form a hybridoma cell (Goding, "Monoclonal 
Antibodies: Principles and Practice" Academic Press, (1986) pp. 59-103). 
Immortalized cell lines are usually transformed mammalian cells, including myeloma 
cells of rodent, bovine, equine, and human origin. Usually, rat or mouse myeloma cell 

15 lines are employed. The hybridoma cells may be cultured in a suitable culture medium 
that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture 
medium for the hybridomas typically will include hypoxanthine, aminopterin, and 

20 thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient 
cells. Preferred immortalized cell lines are those that fuse efficiently, support stable 
high level expression of antibody by the selected antibody-producing cells, and are 
sensitive to a medium such as HAT medium. More preferred immortalized cell lines 
are murine myeloma lines, which can be obtained, for instance, from the Salk Institute 

25 Cell Distribution Center, San Diego, Calif, and the American Type Culture Collection, 
Rockville, Md. Human myeloma and mouse-human heteromyeloma cell lines also 
have been described for the production of human monoclonal antibodies (Kozbor, J. 
Immunol., 133:3001 (1984); Brodeur et al., "Monoclonal Antibody Production 
Techniques and Applications" Marcel Dekker, Inc., New York, (1987) pp. 51-63). The 

30 culture medium in which the hybridoma cells are cultured can then be assayed for the 
presence of monoclonal antibodies directed against BHRF1 protein. Preferably, the 
binding specificity of monoclonal antibodies produced by the hybridoma cells is 
determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such 
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techniques and assays are known in the art, and are described further in the Examples 
below or in Harlow and Lane "Antibodies, A Laboratory Manual" Cold Spring Harbor 
Publications, New York, (1988). 

After the desired hybridoma cells are identified, the clones may be subcloned by 
5 limiting dilution or FACS sorting procedures and grown by standard methods. Suitable 
culture media for this purpose include, for example, Dulbecco's Modified Eagle's 
Medium and RPMI-1640 medium. Alternatively, the hybridoma cells may be grown in 
vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or 
10 purified from the culture medium or ascites fluid by conventional immunoglobulin 
purification procedures such as, for example, protein A-Sepharose, protein G, 
hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity 
chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, 

15 such as those described in U.S. Pat. No. 4,816,567. DNA encoding the monoclonal 
antibodies of the invention can be readily isolated and sequenced using conventional 
procedures (e.g., by using oligonucleotide probes that are capable of binding 
specifically to genes encoding the heavy and light chains of murine antibodies). The 
hybridoma cells of the invention serve as a preferred source of such DNA. Once 

20 isolated, the DNA may be placed into expression vectors, which are then transfected 
into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
plasmacytoma cells, or myeloma cells that do not otherwise produce immunoglobulin 
protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells. 
The DNA also may be modified, for example, by substituting the coding sequence for 

25 human heavy and light chain constant domains in place of the homologous murine 
sequences (U.S. Pat. No. 4,816,567) or by covalently joining to the immunoglobulin 
coding sequence all or part of the coding sequence for a non-immunoglobulin 
polypeptide. Optionally, such a non-immunoglobulin polypeptide is substituted for the 
constant domains of an antibody of the invention or substituted for the variable 

30 domains of one antigen-combining site of an antibody of the invention to create a 

chimeric bivalent antibody comprising one antigen-combining site having specificity 
for BHRF1 polypeptide and another antigen-combining site having specificity for a 
different antigen. 
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The present invention further provides an isolated nucleic acid, comprising a 
nucleotide sequence that encodes an immunoglobulin heavy chain or fragment thereof 
of an antibody or fragment thereof of an antibody that specifically binds an epitope 
contained within amino acids 1-105 of the BHRF1 protein (SEQ ID NO: 1), an isolated 
5 nucleic acid, comprising a nucleotide sequence that encodes an immunoglobulin heavy 
chain or fragment thereof of an antibody or fragment thereof of an antibody that 
specifically binds an epitope contained within amino acids 106-375 of the BHRF1 
protein (SEQ ID NO: 2) and an isolated nucleic acid, comprising a nucleotide sequence 
that encodes an immunoglobulin heavy chain or fragment thereof of an antibody or 

10 fragment thereof of an antibody that specifically binds an epitope contained within 
amino acids 1-375 of the BHRF1 protein (SEQ ID NO: 8). 

Vectors comprising any of the nucleic acids comprising a nucleotide sequence 
that encodes an immunoglobulin heavy chain or a fragment thereof are also 
contemplated by this invention as are host cells comprising such vectors. Suitable 

15 vectors and host cells for expression of the immunoglobulin heavy chains of the present 
invention are described above. 

The present invention further provides an isolated nucleic acid, comprising a 
nucleotide sequence that encodes an immunoglobulin light chain or fragment thereof of 
an antibody or fragment thereof of an antibody that specifically binds an epitope 

20 contained within amino acids 1-105 of the BHRF1 protein (SEQ ID NO: 1), an isolated 
nucleic acid, comprising a nucleotide sequence that encodes an immunoglobulin light 
chain or fragment thereof of an antibody or fragment thereof of an antibody that 
specifically binds an epitope contained within amino acids 106-375 of the BHRF1 
protein (SEQ ID NO: 2) and an isolated nucleic acid, comprising a nucleotide sequence 

25 that encodes an immunoglobulin light chain or fragment thereof of an antibody or 
fragment thereof of an antibody that specifically binds an epitope contained within 
amino acids 1-375 of the BHRF1 protein (SEQ ID NO: 8). 

Vectors comprising any of the nucleic acids comprising a nucleotide sequence 
that encodes an immunoglobulin light chain or a fragment thereof are also 

30 contemplated by this invention as are host cells comprising such vectors. Suitable 

vectors and host cells for expression of the immunoglobulin light chains of the present 
invention are described above. 

Further provided by this invention is a purified polypeptide, comprising an 
amino acid sequence of an immunoglobulin heavy chain or a fragment thereof of an 
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antibody that binds an epitope contained within amino acids 1-105 of the BHRF1 
protein (SEQ ID NO: 1), a purified polypeptide, comprising an amino acid sequence of 
an immunoglobulin heavy chain or a fragment thereof of an antibody that binds an 
epitope contained within amino acids 106-375 of the BHRF1 protein (SEQ ID NO: 2) 
5 and a purified polypeptide, comprising an amino acid sequence of an immunoglobulin 
heavy chain or a fragment thereof of an antibody that binds an epitope contained within 
amino acids 1-375 of the BHRF1 protein (SEQ ID NO: 8). 

Methods of obtaining polypeptides comprising an amino acid sequence of an 
immunoglobulin heavy chain and polypeptides comprising an amino acid sequence of 

10 an immunoglobulin heavy chain are described above. 

Further provided by this invention is a purified polypeptide, comprising an 
amino acid sequence of an immunoglobulin light chain or a fragment thereof of an 
antibody that binds an epitope contained within amino acids 1-105 of the BHRF1 
protein (SEQ ID NO: 1), a purified polypeptide, comprising an amino acid sequence of 

15 an immunoglobulin light chain or a fragment thereof of an antibody that binds an 

epitope contained within amino acids 106-375 of the BHRF1 protein (SEQ ID NO: 2) 
and a purified polypeptide, comprising an amino acid sequence of an immunoglobulin 
light chain or a fragment thereof of an antibody that binds an epitope contained within 
amino acids 1-375 of the BHRF1 protein (SEQ ID NO: 8). 

20 Methods of obtaining polypeptides comprising an amino acid sequence of an 

immunoglobulin light chain and polypeptides comprising an amino acid sequence of an 
immunoglobulin light chain are described above. 

In vitro methods are also suitable for preparing monovalent antibodies. 
Digestion of antibodies to produce fragments thereof, particularly, Fab fragments, can 

25 be accomplished using routine techniques known in the art. For instance, digestion can 
be performed using papain. Examples of papain digestion are described in WO 
94/29348 published Dec. 22, 1994, U.S. Pat. No. 4,342,566, and Harlow and Lane, 
Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York, 
(1988). Papain digestion of antibodies typically produces two identical antigen binding 

30 fragments, called Fab fragments, each with a single antigen binding site, and a residual 
Fc fragment. Pepsin treatment yields a fragment, called the F(ab')2 fragment, that has 
two antigen combining sites and is still capable of cross-linking antigen. 

The Fab fragments produced in the antibody digestion also contain the constant 
domains of the light chain and the first constant domain of the heavy chain. Fab' 
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fragments differ from Fab fragments by the addition of a few residues at the carboxy 
terminus of the heavy chain domain including one or more cysteines from the antibody 
hinge region. The F(ab')2 fragment is a bivalent fragment comprising two Fab' 
fragments linked by a disulfide bridge at the hinge region. Fab'-SH is the designation 
5 herein for Fab' in which the cysteine residue(s) of the constant domains bear a free thiol 
group. Antibody fragments originally were produced as pairs of Fab' fragments which 
have hinge cysteines between them. Other chemical couplings of antibody fragments 
are also known. 

An isolated immunogenically specific paratope or fragment of the antibody is 

10 also provided. A specific immunogenic epitope of the antibody can be isolated from 

the whole antibody by chemical or mechanical disruption of the molecule. The purified 
fragments thus obtained are tested to determine their immunogenicity and specificity by 
the methods taught herein. Immunoreactive paratopes of the antibody, optionally, are 
synthesized directly. An immunoreactive fragment is defined as an amino acid 

15 sequence of at least about two to five consecutive amino acids derived from the 
antibody amino acid sequence. 

One method of producing proteins comprising the antibodies of the present 
invention is to link two or more peptides or polypeptides together by protein chemistry 
techniques. For example, peptides or polypeptides can be chemically synthesized using 

20 currently available laboratory equipment using either Fmoc (9- 

fluorenylmethyloxycarbonyl) or Boc (tert -butyloxycarbonoyl) chemistry. (Applied 
Biosystems, Inc., Foster City, CA). One skilled in the art can readily appreciate that a 
peptide or polypeptide corresponding to the antibody of the present invention, for 
example, can be synthesized by standard chemical reactions. For example, a peptide or 

25 polypeptide can be synthesized and not cleaved from its synthesis resin, whereas the 
other fragment of an antibody can be synthesized and subsequently cleaved from the 
resin, thereby exposing a terminal group which is functionally blocked on the other 
fragment. By peptide condensation reactions, these two fragments can be covalently 
joined via a peptide bond at their carboxyl and amino termini, respectively, to form an 

30 antibody, or fragment thereof. (Grant GA (1992) Synthetic Peptides: A User Guide. 
W.H. Freeman and Co., N.Y. (1992); Bodansky M and Trost B., Ed. (1993) Principles 
of Peptide Synthesis. Springer- Verlag Inc., NY. Alternatively, the peptide or 
polypeptide is independently synthesized in vivo as described above. Once isolated, 
these independent peptides or polypeptides may be linked to form an antibody or 
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fragment thereof via similar peptide condensation reactions. 

For example, enzymatic ligation of cloned or synthetic peptide segments allow 
relatively short peptide fragments to be joined to produce larger peptide fragments, 
polypeptides or whole protein domains (Abrahmsen L et al., Biochemistry, 30:4151 
5 (1991)). Alternatively, native chemical ligation of synthetic peptides can be utilized to 
synthetically construct large peptides or polypeptides from shorter peptide fragments. 
This method consists of a two-step chemical reaction (Dawson et al. Synthesis of 
Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). The first step is 
the chemoselective reaction of an unprotected synthetic peptide-alpha-thioester with 

1 0 another unprotected peptide segment containing an amino-terminal Cys residue to give 
a thioester-linked intermediate as the initial covalent product. Without a change in the 
reaction conditions, this intermediate undergoes spontaneous, rapid intramolecular 
reaction to form a native peptide bond at the ligation site. Application of this native 
chemical ligation method to the total synthesis of a protein molecule is illustrated by 

1 5 the preparation of human interleukin 8 (IL-8) (Baggiolini M et al. (1 992) FEBS Lett. 
307:97-101; Clark-Lewis I et al., J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., 
Biochemistry, 30:3128 (1991); Rajarathnam K et al., Biochemistry 33:6623-30 (1994)). 

Alternatively, unprotected peptide segments are chemically linked where the 
bond formed between the peptide segments as a result of the chemical ligation is an 

20 unnatural (non-peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)). This 
technique has been used to synthesize analogs of protein domains as well as large 
amounts of relatively pure proteins with full biological activity (deLisle Milton RC et 
al., Techniques in Protein Chemistry IY. Academic Press, New York, pp. 257-267 
(1992)). 

25 The invention also provides fragments of antibodies which have bioactivity. 

The polypeptide fragments of the present invention can be recombinant proteins 
obtained by cloning nucleic acids encoding the polypeptide in an expression system 
capable of producing the polypeptide fragments thereof, such as an adenovirus or 
baculovirus expression system. For example, one can determine the active domain of 

30 an antibody from a specific hybridoma that can cause a biological effect associated 
with the interaction of the antibody with BHRF1 protein. For example, amino acids 
found not to contribute to either the activity or the binding specificity or affinity of the 
antibody can be deleted without a loss in the respective activity. For example, in 
various embodiments, amino or carboxy-terminal amino acids are sequentially removed 
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from either the native or the modified non-immunoglobulin molecule or the 
immunoglobulin molecule and the respective activity assayed in one of many available 
assays. In another example, a fragment of an antibody comprises a modified antibody 
wherein at least one amino acid has been substituted for the naturally occurring amino 
5 acid at a specific position, and a portion of either amino terminal or carboxy terminal 
amino acids, or even an internal region of the antibody, has been replaced with a 
polypeptide fragment or other moiety, such as biotin, which can facilitate in the 
purification of the modified antibody. For example, a modified antibody can be fused 
to a maltose binding protein, through either peptide chemistry or cloning the respective 

10 nucleic acids encoding the two polypeptide fragments into an expression vector such 
that the expression of the coding region results in a hybrid polypeptide. The hybrid 
polypeptide can be affinity purified by passing it over an amylose affinity column, and 
the modified antibody receptor can then be separated from the maltose binding region 
by cleaving the hybrid polypeptide with the specific protease factor Xa. (See, for 

15 example, New England Biolabs Product Catalog, 1996, pg. 164.). Similar purification 
procedures are available for isolating hybrid proteins from eukaryotic cells as well. 

The fragments, whether attached to other sequences or not, include insertions, 
deletions, substitutions, or other selected modifications of particular regions or specific 
amino acids residues, provided the activity of the fragment is not significantly altered 

20 or impaired compared to the nonmodified antibody or antibody fragment. These 
modifications can provide for some additional property, such as to remove or add 
amino acids capable of disulfide bonding, to increase its bio-longevity, to alter its 
secretory characteristics, etc. In any case, the fragment must possess a bioactive 
property, such as binding activity, regulation of binding at the binding domain, etc. 

25 Functional or active regions of the antibody may be identified by mutagenesis of a 
specific region of the protein, followed by expression and testing of the expressed 
polypeptide. Such methods are readily apparent to a skilled practitioner in the art and 
can include site-specific mutagenesis of the nucleic acid encoding the antigen. (Zoller 
MJ et al. Nucl. Acids Res. 10:6487-500 (1982). 

30 A variety of immunoassay formats may be used to select antibodies that 

selectively bind with a particular protein, variant, or fragment. For example, solid- 
phase ELISA immunoassays are routinely used to select antibodies selectively 
immunoreactive with a protein, protein variant, or fragment thereof. See Harlow and 
Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, 
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(1988), for a description of immunoassay formats and conditions that could be used to 
determine selective binding. The binding affinity of a monoclonal antibody can, for 
example, be determined by the Scatchard analysis of Munson et al., Anal. Biochem., 
107:220(1980). 

5 Also provided is an antibody reagent kit comprising containers of a monoclonal 

antibody or fragment thereof of the invention and one or more reagents for detecting 
binding of the antibody or fragment thereof to a BHRF1 polypeptide. The reagents can 
include, for example, fluorescent tags, enzymatic tags, or other tags. The reagents can 
also include secondary or tertiary antibodies or reagents for enzymatic reactions, 
10 wherein the enzymatic reactions produce a product that can be visualized. 

Nucleic Acid Detection Methods 

As mentioned above, using Arbitrary-Primed PCR (AP-PCR), mutations were 
found in over 80% of the breast tumor samples analyzed as compared with the 

15 uninvolved breast tissue from the same subject. Further characterization by cloning and 
sequencing of this genomic region harboring mutation revealed that it represents a part 
of a novel gene named BHRF1. The mutation present in breast tumor samples is a 
1270 bp insert located in chromosomal region 17qll.2. Therefore, the present 
invention also provides methods of detecting the genomic region harboring this 

20 mutation as well as methods of detecting BHRF1 transcripts and fragments thereof. 
These methods can be utilized to detect cancer in a subject. 

The term cancer, when used herein refers to or describes the physiological 
condition, preferably in a mamalian subject, that is typically characterized by 
unregulated cell growth. Examples of cancer include but are not limited to ras-induced 

25 cancers, colorectal cancer, carcinoma, lymphoma, sarcoma, blastoma and leukemia. 
More particular examples of such cancers include squamous cell carcinoma, lung 
cancer, pancreatic cancer, cervical cancer, bladder cancer, hepatoma, breast cancer, 
estrogen dependent breast cancer, prostrate carcinoma, rhabdomyosarcoma, colon 
carcinoma, ovarian cancer and head and neck cancer. 

30 The sample of this invention can be from any organism and can be, but is not 

limited to, peripheral blood, bone marrow specimens, primary tumors, embedded tissue 
sections, frozen tissue sections, cell preparations, cytological preparations, exfoliate 
samples (e.g., sputum), fine needle aspirations, lung fluid, amnion cells, fresh tissue, 
dry tissue, and cultured cells or tissue. The sample can be from malignant tissue or 
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non-malignant tissue. The sample can be unfixed or fixed according to standard 
protocols widely available in the art and can also be embedded in a suitable medium for 
preparation of the sample. For example, the sample can be embedded in paraffin or 
other suitable medium (e.g., epoxy or acrylamide) to facilitate preparation of the 
5 biological specimen for the detection methods of this invention. Furthermore, the 
sample can be embedded in any commercially available mounting medium, either 
aqueous or organic. 

The sample can be on, supported by, or attached to, a substrate which facilitates 
detection. A substrate of the present invention can be, but is not limited to, a 

1 0 microscope slide, a culture dish, a culture flask, a culture plate, a culture chamber, 
ELIS A plates, as well as any other substrate that can be used for containing or 
supporting biological samples for analysis according to the methods of the present 
invention. The substrate can be of any material suitable for the purposes of this 
invention, such as, for example, glass, plastic, polystyrene, mica and the like. The 

1 5 substrates of the present invention can be obtained from commercial sources or 
prepared according to standard procedures well known in the art. 

Conversely, an antibody or fragment thereof, an antigenic fragment of BHRF1, 
or BHRF 1 nucleic acid of the invention can be on, supported by, or attached to a 
substrate which facilitates detection. Such a substrate can include a chip, a microarray 

20 or a mobile solid support. Thus, provided by the invention are substrates including one 
or more of the antibodies or antibody fragments, antigenic fragments of BHRF 1 
polypeptides, or BHRF1 nucleic acids of the invention. 

The nucleic acids of the present invention can be utilized as probes or primers to 
detect BHRF1 nucleic acids. For example, a polynucleotide probe or primer 

25 comprising a polynucleotide selected from the group consisting of at least 25 

contiguous nucleotides of SEQ ID NO: 3, at least 25 contiguous nucleotides of SEQ ID 
NO: 4, at least 25 contiguous nucleotides from SEQ ID NO: 5 and at least 25 
contiguous nucleotides of SEQ ID NO: 9 can be utilized to detect a BHRF1 nucleic 
acid. Therefore, the polynucleotide probes or primers of this invention can be at least 

30 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 
130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195 or at least 200 
nucleotides in length. 

Further provided by the present invention is a method for detecting the presence 
of cancer in a subject, comprising: (a)contacting a biological sample obtained from a 
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subject with at least two oligonucleotide primers, each primer consisting of 10 to 200 
contiguous nucleotides of SEQ ID NO: 5 or the complement therof, in a reverse 
transcriptase polymerase chain reaction; and (b) detecting in the sample a 
polynucleotide sequence that amplifies in the presence of said oligonucleotide primers, 
5 wherein the presence of an amplified polynucleotide sequence indicates the presence of 
cancer in the subject. 

Also provided by the present invention is a method for detecting the presence of 
cancer in a subject, comprising: (a) contacting a biological sample obtained from a 
subject with a nucleic acid probe under conditions that allow the probe to selectively 
10 bind a BRHF1 nucleic acid; and (b) detecting the presence of a BRHF1 nucleic acid, 
whereby the presence of a BRHF1 nucleic acid indicates the presence of cancer in the 
subject. 

As used herein, the term "nucleic acid probe" refers to a nucleic acid fragment 
that selectively hybridizes under stringent conditions with a nucleic acid comprising a 

15 nucleic acid set forth in a sequence listed herein. This hybridization must be specific. 
The degree of complementarity between the hybridizing nucleic acid and the sequence 
to which it hybridizes should be at least enough to exclude hybridization with a nucleic 
acid encoding an unrelated protein. 

Stringent conditions refers to the washing conditions used in a hybridization 

20 protocol. In general, the washing conditions should be a combination of temperature 
and salt concentration chosen so that the denaturation temperature is approximately 
5-20°C below the calculated T m of the nucleic acid hybrid under study. The 
temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized 

25 to the probe or protein coding nucleic acid of interest and then washed under conditions 
of different stringencies. The T m of such an oligonucleotide can be estimated by 
allowing 2°C for each A or T nucleotide, and 4°C for each G or C. For example, an 18 
nucleotide probe of 50% G+C would, therefore, have an approximate T m of 54°C. 
Stringent conditions are known to one of skill in the art. See, for example, 

30 Sambrook et al. (2001). An example of stringent wash conditions is 4 X SSC at 65 °C. 
Highly stringent wash conditions include, for example, 0.2 X SSC at 65 °C. 

A number of methods are available for analyzing nucleic acids for the presence 
of a specific sequence. For all of the methods described herein, genomic DNA can be 
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extracted from a sample and this sample can be from any organism and can be, but is 
not limited to, peripheral blood, bone marrow specimens, primary tumors, embedded 
tissue sections, frozen tissue sections, cell preparations, cytological preparations, 
exfoliate samples (e.g., sputum), fine needle aspirations, amnion cells, fresh tissue, dry 
5 tissue, and cultured cells or tissue. Such samples can be obtained directly from a 
subject, commercially obtained or obtained via other means. Thus, the invention 
described herein can be utilized to analyze a nucleic acid sample that comprises 
genomic DNA, amplified DNA( such as a PCR product) cDNA, cRNA, a restriction 
fragment or any other desired nucleic acid sample. When one performs one of the 
10 herein described methods on genomic DNA, typically the genomic DNA will be treated 
in a manner to reduce viscosity of the DNA and allow better contact of a primer or 
probe with the target region of the genomic DNA. Such reduction in viscosity can be 
achieved by any desired methods, which are known to the skilled artisan, such as 
DNase treatment or shearing of the genomic DNA, preferably lightly. 

15 If sufficient DNA is available, genomic DNA can be used directly. 

Alternatively, the region of interest is cloned into a suitable vector and grown in 
sufficient quantity for analysis. The nucleic acid may be amplified by conventional 
techniques, such as the polymerase chain reaction (PCR), to provide sufficient amounts 
for analysis. A variety of PCR techniques are familiar to those skilled in the art. For a 

20 review of PCR technology, see White (1997) and the publication entitled "PCR 

Methods and Applications" (1991, Cold Spring Harbor Laboratory Press), which is 
incorporated herein by reference in its entirety for amplification methods. In each of 
these PCR procedures, PCR primers on either side of the nucleic acid sequences to be 
amplified are added to a suitably prepared nucleic acid sample along with dNTPs and a 

25 thermostable polymerase such as Taq polymerase, Pfu polymerase, or Vent 

polymerase. The nucleic acid in the sample is denatured and the PCR primers are 
specifically hybridized to complementary nucleic acid sequences in the sample. The 
hybridized primers are extended. Thereafter, another cycle of denaturation, 
hybridization, and extension is initiated. The cycles are repeated multiple times to 

30 produce an amplified fragment containing the nucleic acid sequence between the 

primer sites. PCR has further been described in several patents including U.S. Pat. Nos. 
4,683,195, 4,683,202 and 4,965,188. Each of these publications is incorporated herein 
by reference in its entirety for PCR methods. One of skill in the art would know how to 
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design and synthesize primers that amplify SEQ ID NO: 5 or a fragment thereof. For 
example, the present invention provides forward primer (5'-CAGAGCCTGT-3') (SEQ 
ID NO: 6) and reverse primer (5'-CTCTGGGACA-3') (SEQ ID NO: 7) which can be 
utilized to amplify the genomic region of chromosome 17 comprising an insertion of 
5 approximately 1270 base pairs associated with cancer. 

A detectable label may be included in an amplification reaction. Suitable labels 
include fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, 
phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 2',7'-dimethoxy-4',5'- 
dichloro-6-carboxyfluorescein (JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy- 

10 2',4',7 , ,4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N',N'- 
tetramethyl-6-carboxyrhodamine (TAMRA), radioactive labels, e.g., 32 P, 35 S, 3 H- etc. 
The label may be a two stage system, where the amplified DNA is conjugated to biotin, 
haptens, etc. having a high affinity binding partner, e.g. avidin, specific antibodies, etc., 
where the binding partner is conjugated to a detectable label. The label may be 

15 conjugated to one or both of the primers. Alternatively, the pool of nucleotides used in 
the amplification is labeled, so as to incorporate the label into the amplification 
product. 

The sample nucleic acid, e.g. amplified or cloned fragment, can be analyzed by 
one of a number of methods known in the art. The nucleic acid can be sequenced by 
20 dideoxy or other methods. Hybridization with the sequence can also be used to 
determine its presence, by Southern blots, dot blots, etc. 

The BHRF1 nucleic acids of the invention can also be used in polynucleotide 
arrays. Polynucleotide arrays provide a high throughput technique that can assay a large 
number of polynucleotide sequences in a single sample. This technology can be used, 
25 for example, as a diagnostic tool to identify metastatic lesions or to assess the 
metastatic potential of a tumor. 

To create arrays, single-stranded polynucleotide probes can be spotted onto a 
substrate in a two-dimensional matrix or array. Each single-stranded polynucleotide 
probe can comprise at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, or 30 
30 or more contiguous nucleotides selected from the nucleotide sequences shown in SEQ 
ID NOS:2, 4 and 5. 

The substrate can be any substrate to which polynucleotide probes can be 
attached, including but not limited to glass, nitrocellulose, silicon, and nylon. 
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Polynucleotide probes can be bound to the substrate by either covalent bonds or by 
non-specific interactions, such as hydrophobic interactions. Techniques for constructing 
arrays and methods of using these arrays are described in EP No. 0 799 897; PCT No. 
WO 97/29212; PCT No. WO 97/27317; EP No. 0 785 280; PCT No. WO 97/02357; 
5 U.S. Pat. Nos. 5,593,839; 5,578,832; EP No. 0 728 520; U.S. Pat. No. 5,599,695; EP 
No. 0 721 016; U.S. Pat. No. 5,556,752; PCT No. WO 95/22058; and U.S. Pat. No. 
5,631,734. Commercially available polynucleotide arrays, such as Affymetrix 
GeneChip.TM., can also be used. Use of the GeneChip.™. to detect gene expression is 
described, for example, in Lockhart et al., Nature Biotechnology 14:1675 (1996); Chee 
10 et al., Science 274:610 (1996); Hacia et al., Nature Genetics 14:441, 1996; and Kozal et 
al., Nature Medicine 2:753, 1996. 

Tissue samples which are suspected of being metastatic or the metastatic 
potential of which is unknown can be treated to form single-stranded polynucleotides, 
for example by heating or by chemical denaturation, as is known in the art. The single- 

15 stranded polynucleotides in the tissue sample can then be labeled and hybridized to the 
polynucleotide probes on the array. Detectable labels which can be used include but are 
not limited to radiolabels, biotinylated labels, fluorophors, and chemiluminescent 
labels. Double stranded polynucleotides, comprising the labeled sample 
polynucleotides bound to polynucleotide probes, can be detected once the unbound 

20 portion of the sample is washed away. Detection can be visual or with computer 
assistance. 

Further provided by the present invention is a method of determining the stage 
of cancer in a subject comprising: (a) contacting a biological sample obtained from a 
subject with a nucleic acid probe under conditions that allow the probe to selectively 

25 bind a BRHF1 nucleic acid, (b) detecting the presence of a BRHF1 nucleic acid and 
correlating the amount of BHRF1 nucleic acids in the sample with a clinically defined 
stage of cancer, thereby determining the stage of cancer in the subject. 

Since therapy and clinical decisions are often dependent on diagnosis and 
staging, BHRF 1 detection with BHRF1 probes allows correlation of BHRFT nucleic 

30 acid levels with a particular stage of cancer. One skilled in the art would be able to 

measure BHRF1 nucleic acid levels (for example, mRNA levels) in numerous subjects 
in order to establish ranges of BHRF 1 levels that correspond to clinically defined stages 
of cancer. The process of determining the clinical stages of cancer would be known to 
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one of skill in the art. These ranges will allow the skilled practitioner to measure 
BHRF1 nucleic acid levels in a subject diagnosed with cancer and correlate the levels 
in each subject with a range that corresponds to a stage of cancer. Any of the detection 
methods described herein can be utilized with other detection methods, including, but 
5 not limited to, visual inspection, histopathological examination, radiological 

examination, biopsy and MRI to confirm the presence of cancer, to confirm the type of 
cancer and/or to determine a clinical stage of cancer. 

A person skilled in the art would know that the methods of this invention can 
also be utilized to test the efficacy of an anti-cancer treatment. For example, if the 

10 patient diagnosed with cancer is assayed for BHRF1 nucleic acid levels prior to the 
administration of an anti-cancer treatment and assayed at a second time point after the 
administration of the anti-cancer treatment, a decrease in BHRF1 nucleic acid levels 
may indicate an effective anti-cancer treatment had been administered. The skilled 
practitioner will associate the decreases observed with a particular level of 

15 effectiveness. If no decrease is observed, the anti-cancer treatment may need to be 
adjusted. 

Furthermore, since the increased expression of BHRF1 is associated with 
estrogen dependent breast cancers, upon detection of increased levels of BHRF1 or 
correlation of BHRF1 levels with a particular stage of breast cancer, the skilled 
20 practitioner can administer a therapy suited for the treatment of estrogen-dependent 
breast cancer. 

The patients of this invention undergoing anti-cancer therapy can include 
patients undergoing surgery, chemotherapy, radiotherapy, immunotherapy or any 
combination thereof. Examples of chemotherapeutic agents include cisplatin, 5- 
25 fluorouracil and S-l. Immunotherapeutics methods include administration of 
interleukin-2 and interferon-a. 

Antibody Detection Methods 

The present invention provides a method of detecting the presence of cancer in 
a subject comprising: a) contacting a sample from the subject with an antibody to a 
30 BRHF1 polypeptide; and b) detecting the antibody bound to the BRHF1 polypeptide in 
the sample, wherein binding of BRHF1 polypeptide to the antibody indicates the 
presence of a BRHF1 polypeptide in the sample, the presence of a BRHF1 polypeptide 
indicating the presence of cancer in the subject. 
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Any of the detection methods described herein can be utilized to detect any 
BHRF1 polypeptide and can be utilized for initial diagnosis of cancer as well as in 
methods of determining the progression of cancer in a subject or determining the 
efficacy of treatment. The antibody, or fragment thereof, can be linked to a detectable 
5 label either directly or indirectly through use of a secondary and/or tertiary antibody; 
thus, bound antibody, fragment or molecular complex can be detected directly in an 
ELISA or similar assay. 

The sample can be on, supported by, or attached to, a substrate which facilitates 
detection. A substrate of the present invention can be, but is not limited to, a 

1 0 microscope slide, a culture dish, a culture flask, a culture plate, a culture chamber, 
ELISA plates, as well as any other substrate that can be used for containing or 
supporting biological samples for analysis according to the methods of the present 
invention. The substrate can be of any material suitable for the purposes of this 
invention, such as, for example, glass, plastic, polystyrene, mica and the like. The 

15 substrates of the present invention can be obtained from commercial sources or 
prepared according to standard procedures well known in the art. 

Conversely, an antibody or fragment thereof, an antigenic fragment of a BHRF1 
polypeptide can be on, supported by, or attached to a substrate which facilitates 
detection. Such a substrate can be a mobile solid support. Thus, provided by the 

20 invention are substrates including one or more of the antibodies or antibody fragments, 
or antigenic fragments of a BHRF1 polypeptide. 

Further provided by the present invention is a method of determining the stage 
cancer in a subject comprising: a) contacting a sample from the subject with an 
antibody to a BRHF1 polypeptide, b) detecting the antibody bound to the BRHF1 

25 polypeptide in the sample, and correlating the amount of bound antibody in the sample 
with a clinically defined stage of cancer, thereby determining the stage of cancer in the 
subject. 

Since therapy and clinical decisions are often dependent on diagnosis, BHRF 1 
detection with this antibody or fragment thereof allows correlation of BHRF 1 
30 polypeptide levels with a particular stage of cancer. One skilled in the art would be 
able to measure BHRF1 levels in numerous subjects in order to establish ranges of 
BHRF1 levels that correspond to clinically defined stages of cancer. The process of 
determining the clinical stages of cancer would be known to one of skill in the art. 
These ranges will allow the skilled practitioner to measure BHRF1 levels in a subject 
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diagnosed with cancer and correlate the levels in each subject with a range that 
corresponds to a stage of cancer. 

A person skilled in the art would know that the methods of this invention can 
also be utilized to test the efficacy of an anti-cancer treatment. For example, if the 
5 patient diagnosed with cancer is assayed for BHRF1 levels prior to the administration 
of an anti-cancer treatment and assayed at a second time point after the administration 
of the anti-cancer treatment, a decrease in BHRF1 levels may indicate an effective anti- 
cancer treatment had been administered. The skilled practitioner will associate the 
decreases observed with a particular level of effectiveness. If no decrease is observed, 
1 0 the anti-cancer treatment may need to be adjusted. 

Furthermore, since the increased expression of BHRF1 is associated with 
estrogen dependent breast cancers, upon detection of increased levels of BHRF1 or 
correlation of BHRF1 levels with a particular stage of breast cancer, the skilled 
practitioner can administer a therapy suited for the treatment of estrogen-dependent 
1 5 breast cancer. 

The patients of this invention undergoing anti-cancer therapy can include 
patients undergoing surgery, chemotherapy, radiotherapy, immunotherapy or any 
combination thereof. Examples of chemotherapeutic agents include cisplatin, 5- 
fluorouracil and S-l . Immunotherapeutics methods include administration of 
20 interleukin-2 and interferon-a. 

Inhibition of BHRF1 

The present invention provides a method of reducing BRHF1 expression in a 
25 cell comprising administering to the cell an antisense oligonucleotide that specifically 
binds to mRNA transcribed from the BRHF1 gene under conditions that allow 
hybridization, wherein the BRHF1 mRNA comprises a nucleotide sequence selected 
from the group consisting of at least 15 contiguous nucleotides from SEQ ID NO: 3, at 
least 1 5 contiguous nucleotides from SEQ ID NO 4 and at least 1 5 contiguous 
30 nucleotides from SEQ ID NO: 9, and wherein hybridization of the antisense 

oligonucleotide with the BRHF1 mRNA reduces BRHF1 expression. For example, a 
sense strand sequence derived from SEQ ID NO: 3 (5'tgagcacaactattccgatc3') (SEQ ED 
NO: 10) can be utilized as a BRHF-1 specific antisense oligonucleotide. As another 
example, a sense strand sequence derived from SEQ ID NO: 4 
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(5'aagcaacttcagcaaagtctcag 3') (SEQ ID NO: 1 1) can be utilized as a BRHF-1 specific 
antisense oligonucleotide. 

As used herein, "reducing" or "inhibiting" means a decrease in a BHRF1 
expression levels. Such reduction does not have to be complete and can range from a 
5 slight decrease in BRHF1 expression levels to complete inhibition of BRHF1 
expression levels. 

Antisense technology is well known in the art and describes a mechanism 
whereby a nucleic acid comprising a nucleotide sequence which is in a complementary, 
"antisense" orientation with respect to a coding or "sense" sequence of an endogenous 

1 0 gene, is introduced into a cell, whereby a duplex may form between the antisense 
sequence and its complementary sense sequence. The formation of this duplex may 
result in inactivation of the endogenous gene. 

For example, the antisense nucleic acid can inhibit gene expression by forming 
an RNA/RNA duplex between the antisense RNA and the RNA transcribed from a 

15 target gene. The precise mechanism by which this duplex formation decreases the 

production of the protein encoded by the endogenous gene most likely involves binding 
of complementary regions of the normal sense mRNA and the antisense RNA strand 
with duplex formation in a manner that blocks RNA processing and translation. 
Alternative mechanisms include the formation of a triplex between the antisense RNA 

20 and duplex DNA or the formation of a DNA-RNA duplex with subsequent degradation 
of DNA-RNA hybrids by RNAse H. Furthermore, an antisense effect can result from 
certain DNA-based oligonucleotides via triple-helix formation between the oligomer 
and double-stranded DNA which results in the repression of gene transcription. 
Antisense nucleic acid can be produced for any relevant endogenous gene for which the 

25 coding sequence has been or can be determined according to well known methods. 

A nucleic acid encoding an antisense RNA can be selected based on the protein 
desired to be inhibited or decreased in cells, by providing an RNA that will selectively 
bind to the cellular mRNA encoding such protein. Binding of the antisense molecule to 
the target mRNA may incapacitate the mRNAs, thus preventing its translation into a 

30 functional protein. The antisense RNA/mRNA complexes can then become a target for 
RNAse-H and are eventually degraded by the host cell RNAse-H. Control regions, 
such as enhancers and promoters, can be selected for antisense RNA targeting 
according to the cell or tissue in which it is to be expressed, as is known in the art. 
Preferable antisense-encoding constructs can encode full-length complements to target 
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sequences; however, smaller length sequences down to oligonucleotide size can be 
utilized. For example, the antisense-encoding constructs can encode full-length 
complements to the BHRF1 gene, smaller length sequences or oligonucleotide 
sequences. 

5 The present invention also provides a method of reducing BRHF1 expression 

comprising administering to a cell a ribozyme that specifically binds to mRNA 
transcribed from the BRHF1 gene, the ribozyme binding reducing BRHF1 expression. 

Therefore, when BFfRFl expression can be decreased using a ribozyme, an 
RNA molecule with catalytic activity. See, e.g., Cech, 1987, Science 236: 1532-1539; 

10 Cech, 1990, Ann. Rev. Biochem. 59:543-568; Cech, 1992, Curr. Opin. Struct. Biol. 2: 
605-609; Couture and Stinchcomb, 1996, Trends Genet. 12: 510-515. Ribozymes can 
be used to inhibit gene function by cleaving an RNA sequence, as is known in the art 
(e.g., Haseloff et al., U.S. Pat. No. 5,641,673). 

The coding sequence of BFfRFl can be used to generate a ribozyme which will 

15 specifically bind to mRNA transcribed from a BHRF1 gene. Methods of designing and 
constructing ribozymes which can cleave other RNA molecules in trans in a highly 
sequence specific manner have been developed and described in the art (see Haseloff et 
al. (1988), Nature 334:585-591). For example, the cleavage activity of ribozymes can 
be targeted to specific RNAs by engineering a discrete "hybridization" region into the 

20 ribozyme. The hybridization region contains a sequence complementary to the target 
RNA and thus specifically hybridizes with the target (see, for example, Gerlach et al., 
EP 321,201). Longer complementary sequences can be used to increase the affinity of 
the hybridization sequence for the target. The hybridizing and cleavage regions of the 
ribozyme can be integrally related; thus, upon hybridizing to the target RNA through 

25 the complementary regions, the catalytic region of the ribozyme can cleave the target. 

Ribozymes can be introduced into cells as part of a DNA construct, as is known 
in the art. The DNA construct can also include transcriptional regulatory elements, such 
as a promoter element, an enhancer or UAS element, and a transcriptional terminator 
signal, for controlling the transcription of the ribozyme in the cells. 

30 Mechanical methods, such as microinjection, liposome-mediated transfection, 

electroporation, or calcium phosphate precipitation, can be used to introduce the 
ribozyme-containing DNA construct into cells whose division it is desired to decrease, 
as described above. Alternatively, if it is desired that the DNA construct be stably 
retained by the cells, the DNA construct can be supplied on a plasmid and maintained 
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as a separate element or integrated into the genome of the cells, as is known in the art. 

As taught in Haseloff et al., U.S. Pat. No. 5,641,673, the ribozyme can be 
engineered so that its expression will occur in response to factors which induce 
expression of BHRF1. The ribozyme can also be engineered to provide an additional 
5 level of regulation, so that destruction of mRNA occurs only when both the ribozyme 
and the BHRF1 gene are induced in the cells. 

Further provided by the present invention is a method of reducing BRHF1 
expression comprising administering to a cell an siRNA that is complementary to at 
least a portion of the coding sequence of BRHF1, under conditions that allow 

10 hybridization of the siRNA with the BRHF1 coding sequence, wherein the BRHF1 

coding sequence comprises a nucleotide sequence selected from the group consisting of 
at least 21 contiguous nucleotides of SEQ ID NO: 3 and at least 21 contiguous 
nucleotides of SEQ ED NO: 4 and wherein the binding of the siRNA to the BRHF1 
coding sequence reduces BRHF1 expression. For example, a sense strand sequence 

15 derived from SEQ ED NO:4 (5'AAACCACTGCTCAACGAAATA3') (SEQ ID NO: 
12) can be utilized to generate a siRNA oligonucleotide template. As another example, 
a sense strand sequence derived from SEQ ID NO: 3 

(5'AAGGGATTATCGCCTATCGCC3')(SEQ ID NO: 13)can be utilized to generate a 
siRNA oligonucleotide template. 

20 Also provided by the present invention is a method of reducing the activity of a 

BHRF1 polypeptide, comprising contacting the protein or the cell with one or more 
anti-BHRFl antibodies of the present invention. Activities of BHRF1 include, but are 
not limited to, BHRF1 binding to a cell, BRHF1 binding to another polypeptide, 
BRHF1 binding to a nucleic acid or BRHF1 binding to a receptor. Other activities of 

25 BHRF1 include promoting increased cell proliferation and other activities associated 
with cancer. As used herein, "reducing" means a decrease in a BHRF1 activity as 
compared to a control. The control can include BHRF1 polypeptides or a cell 
expressing a BHRF1 polypepitde in the absence of the antibody or other agent. Thus, 
the control can include a cell or protein before or after contact with the antibody or 

30 other agent or can include a cell or protein that is not contacted with the antibody or 
agent. Such a reduction in activity does not have to be complete and can range from a 
slight decrease in BHRF1 activity to complete inhibition of BHRF1 activity. 
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Methods of inhibiting or reducing BHRF1 expression as well as methods of 
inhibiting or reducing BHRF1 activity can be utilized to treat subjects with cancer or 
other proliferative disorders. As used herein, "treating" describes an improvement in 
the patient's clinical state. The improvement may range from reduction of the 
5 symptoms of the disease to complete amelioration of the disease. 

Optimal dosages used will vary according to the individual being treated and the 
inhibitor being used. The amount of inhibitor will also vary among individuals on the 
basis of age, size, weight, condition, etc. One skilled in the art will realize that dosages 
are best optimized by the practicing physician and methods for determining dose 
10 amounts and regimens and preparing dosage forms are described, for example, in 
Remington 's Pharmaceutical Sciences. For example, suitable doses and dosage 
regimens can be determined by comparison to agents presently used in the treatment or 
prevention of inflammation or autoimmune disorders. 

Typically, the inhibitor of this invention can be administered orally or 
15 parenterally in a dosage range of 0.1 to 100 mg/kg of body weight depending on the 
clinical response that is to be obtained. Administration of inhibitor can be stopped 
completely following a prolonged remission or stabilization of disease signs and 
symptoms and readministered following a worsening of either the signs or symptoms of 
the disease. 

20 The efficacy of administration of a particular dose of inhibitor in treating cancer 

as described herein can be determined by evaluating the particular aspects of the 
medical history, the signs, symptoms and objective laboratory tests that have a 
documented utility in evaluating pathophysiological activity of the particular cancerr 
being treated. These signs, symptoms and objective laboratory tests will vary 

25 depending on the particular disorder being treated, as will be well known to any 

clinician in this field. For example, if the progression of cancer is shown to be reduced 
or ameliorated then a particular treatment can be considered efficacious. 

Once it is established that disease activity is significantly improved or stabilized 
by a particular inhibitor, specific signs, symptoms and laboratory tests can be evaluated 

30 in accordance with a reduced or discontinued treatment schedule. If a disease activity 
recurs, based on standard methods of evaluation of the particular signs, symptoms and 
objective laboratory tests as described herein, treatment can be reinitiated. 



43 



ATTORNEY DOCKET NO. 21085.0064U1 



The present invention also provides inhibitors of BHRF1 expression and/or 
activity, in a pharmaceutically acceptable carrier. By "pharmaceutically acceptable" is 
meant a material that is not biologically or otherwise undesirable, i.e., the material may 
be administered to an individual along with the selected compound without causing any 
5 undesirable biological effects or interacting in a undesirable manner with any of the 
other components of the pharmaceutical composition in which it is contained. The 
carrier may depend on the method of administration and the particular patient. It is also 
noted that not all methods of administering the inhibitors of BHRF1 described herein 
require a pharmaceutically acceptable carrier. 

10 In the present invention, the inhibitors can be orally or parenterally administered 

in a carrier pharmaceutically acceptable to human subjects. Suitable carriers for oral or 
inhaled administration can include one or more of the carriers pharmaceutically 
acceptable to human subjects. Suitable carriers for oral administration include one or 
more substances which may also act as a flavoring agents, lubricants, suspending 

15 agents, or as protectants. Suitable solid carriers include calcium phosphate, calcium 
carbonate, magnesium stearate, sugars, starch, gelatin, cellulose, 
carboxypolymethylene, or cyclodextrans. Suitable liquid carriers may be water, 
pyrogen free saline, pharmaceutically accepted oils, or a mixture of any of these. The 
liquid can also contain other suitable pharmaceutical addition such as buffers, 

20 preservatives, flavoring agents, viscosity or osmo-regulators, stabilizers or suspending 
agents. Examples of suitable liquid carriers include water with or without various 
additives, including carboxypolymethylene as a ph-regulated gel. The inhibitor may be 
contained in enteric coated capsules that release the polypeptide into the intestine to 
avoid gastric breakdown. For parenteral administration of the antagonist, a sterile 

25 solution or suspension is prepared in saline that may contain additives, such as ethyl 
oleate or isopropyl myristate, and can be injected for example, into subcutaneous or 
intramuscular tissues, as well as intravenously. 

Similarly, a nucleic acid such as an antisense nucleic acid, a ribozyme, an 
siRNA or a nucleic encoding a polypeptide inhibitor of BHRF1 can be administered. 

30 The nucleic acid encoding the polypeptide of this invention can be placed into a vector 
and delivered to the cells of a subject either in vivo or ex vivo by standard methods. 
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Screening Methods 

The present invention provides a method of identifying a compound that 
reduces BRHF1 expression, comprising administering a test compound to a cell 
containing a BRHF1 gene and detecting the level of the BRHFlgene product produced, 
5 a decrease in the gene product as compared to a control cell indicating the compound 
reduces BRHF1 expression. 

The "cells" utilized in the screening methods of this invention include any cell 
type, cancerous or noncancerous, that may express or be affected by the expression of 
a BHRF1 gene or the activity of a BHRF1 protein. Examples include, but are not 
10 limited to, breast cancer cells, colorectal cancer cells, pancreatic cancer cells, 

gastrointestinal cancer cells and prostate cancer cells. The cell could also include a cell 
that comprises an exogenous nucleic acid that encodes BHRF1. 

As utilized herein, "a BRHF1 gene product" can be, for example BRHF1 
mRNA, or a BRHF1 polypeptide. Methods of detecting a BRHF1 nucleic acid and a 
15 BRHF1 polypeptide are discussed herein. Furthermore, these methods are not limited 
to those discussed herein as there are numerous methods for detecting and measuring 
expression known to those skilled in the art. 

Further provided by this invention is a method of identifying a compound that 
reduces BRHF1 expression, comprising administering a test compound to a cell 
20 containing a nucleic acid encoding SEQ ID NO: 1 and detecting the level of the 

BRHF1 gene product produced, a decrease in the gene product indicating a compound 
that reduces BRHF1 expression. 

Also provided is a method of identifying a compound that reduces BRHF1 
expression in the presence of estrogen, comprising administering a test compound and 
25 estrogen to a cell containing a BRHF1 gene and detecting the level of the BRHF1 gene 
product produced, a decrease in the gene product as compared to a control cell 
indicating a compound that reduces BRHF1 expression in the presence of estrogen. 

The present invention also provides a method of identifying a compound that 
reduces BRHF1 expression in the presence of estrogen, comprising administering a test 
30 compound and estrogen to a cell containing a nucleic acid encoding the polypeptide of 
claim 1 and detecting the level of the BRHFlgene product produced, a decrease in the 
gene product indicating a compound that reduces BRHF1 expression in the presence of 
estrogen. 
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EXAMPLES 

A Novel BRHF1 Gene 

The present invention shows that BRHF1 gene is mutated with very high 
5 frequency (81% of the samples analyzed) in human breast tumors: Using Arbitrary- 
Primed PCR (AP-PCR) mutations in the human breast cancer genome were found. An 
insertion of an approximately 1270 bp amplification product (resolved on agarose gel) 
was observed in 81% (17 out of 21) of the tumor samples as compared with the 
uninvolved breast tissue from the same patient. The 1270 bp amplification product 
10 resolved into two bands of 1200 and 1300 bp when applied on the denaturing 
sequencing gel (Figure. 1). 

To confirm the purity (i.e., origin from a single genomic region) of this 
amplification product, the agarose gel eluted fragment was cloned, and then run on a 
sequencing gel, where it again resolved into two bands (1200 and 1300 bp), indicating 
15 that the two bands represent the two single-stranded fragments of the 1270 bp double- 
stranded DNA fragment. This suggests a semi-stable conformation in this DNA region. 

BRHF1 is located on q 112 region of chromosome 17, a highly unstable chromosome in 
breast cancer 

20 

Plasmid DNA from BAC clone HCIT 54B22 (Research Genetics, Huntsville, 
AL) that contains a BRHF-1 gene sequence was labeled with digoxigenin dUTP by 
nick translation. A specific hybridization signal was detected by incubating the 
hybridized slides in fluorescent antidigoxigenin antibodies followed by counterstaining 
25 with DAPI. Measurements in 10 cells showing a specifically labeled chromosome 17 
demonstrated that the BRHF1 gene is located at a position that corresponds to 17q 11 2 . 

BRHF1 has significant sequence homology with a known uncharacterized cDNA clone 
reported from human breast tumor 

30 

The 1270 bp AP-PCR amplification product from the breast tumor DNA was 
cloned into a PCRII vector using TA cloning kit (Invitrogen, CA). Ten clones were 
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sequenced to confirm the insert sequence, and the final sequence was compared with 
known sequences in the Gen Bank database using the BLAST program. BLAST 
analysis revealed 99% homology of this fragment with one of the genomic clones 
(HCIT 542B22, Accession number AC004253). The homology search using dbEST 
5 revealed that the tumor-specific 1270 bp amplified fragment had 93% similarity with 
one of the human EST clone (Accession # BE 065399) sequence. Interestingly, this 
EST has been reported recently from cDNA derived from human breast tumor tissue 
(Dias et al. 2000). ORF analysis predicted an ORF of 270 amino acid within this 
sequence. RT-PCR analysis confirmed that this ORF is transcriptionally active. 
1 0 Sequence analysis revealed point mutations in tumors at the primer binding site, and 
these mutations resulted in amino acid substitutions (Isoleucine [in normal] — > Valine 
[in tumor]; Aspartic acid [in normal] — > Arginine [in tumor]) in the predicted ORF. 

The BRHF1 gene transcript size is 6.50 kb and is specific to estrogen dependent breast 
15 cancer 

Northern blot analysis of the RNA isolated from the MCF7 human breast cancer 
cell line revealed a 6.5 kb transcript of the BRHF1 gene (Figure 3a). The probe used for 
the Northern blot analysis was PCR amplified from the cDNA using the primer set 

20 specific for the ORF region of the BRHF1 gene. To ensure the specificity of the probe 
DNA, the PCR product was cloned and sequenced. Our preliminary data of Northern 
blot analysis with RNA from a ER-positive breast cancer cell line (MCF7) and a ER- 
negative breast cancer cell line (MDA-MB 468) indicate that this gene is differentially 
expressed in MCF 7 and not in ER-negative MDA-MB 468 breast cancer cells (Fig. 

25 3b). 

BRHF1 transcript is multifold higher in breast tumors as compared to the matched 
uninvolved normal breast tissue 

30 Primers were designed from the ORF region of the BRHF1 gene sequence. 

Using the TRIzol method (Invitrogen), total RNA was isolated from tumors and from 
corresponding uninvolved normal breast tissues. cDNA was synthesized using the first 
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strand cDNA synthesis kit (Invitrgen). RT-PCR analysis revealed multifold increased 
expression of this gene in breast tumors as compared to uninvolved normal tissue. 

BRHF1 gene transcript is upregulated by estrogen treatment 

5 

Higher level of gene transcript level measured by RT-PCR from estrogen (17 0- 
estradiol) treated-breast cancer cells as compared with the untreated control cells 
indicate that BRHF1 gene is estrogen responsive (Fig. 5). 

In summary, the present invention provides mutations in an uncharacterized 

10 region of the breast cancer genome located at 17qll.2. This uncharacterized genomic 
region harboring mutation represents a part of a novel gene (BRHF1) and this 
BRHF1 gene harbors mutations in 81% of the sporadic breast tumor samples analyzed. 
The full-length cDNA sequence of this gene is 6.5 kb. These studies also showed that 
the BRHF1 gene is upregulated in human breast tumors as the BRHF1 gene is 

15 expressed only in the ER-positive breast cancer cells, not in ER-negative breast cancer 
cells. Furthermore, this invention showed that BRHFJ is an estrogen-responsive gene in 
breast cancer. 

Throughout this application, various publications are referenced. The 
disclosures of these publications in their entireties are hereby incorporated by reference 
20 into this application in order to more fully describe the state of the art to which this 
invention pertains. 

It will be apparent to those skilled in the art that various modifications and 
variations can be made in the present invention without departing from the scope or 
spirit of the invention. Other embodiments of the invention will be apparent to those 
25 skilled in the art from consideration of the specification and practice of the invention 
disclosed herein. It is intended that the specification and examples be considered as 
exemplary only, with a true scope and spirit of the invention being indicated by the 
following claims. 
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What is claimed is: 

1. A purified polypeptide comprising SEQ ID NO: 1. 

2. A purified polypeptide comprising an amino acid sequence at least about 
95% identical to the sequence of SEQ ID NO:l. 

3. A purified polypeptide comprising the amino acid sequence of SEQ ID 
NO:l with one or more conservative amino acid substitutions. 

4. A nucleic acid encoding the polypeptide of claim 1 . 

5. The nucleic acid of claim 4, wherein the nucleic acid comprises SEQ ID 
NO: 3 

6. A vector comprising the nucleic acid of claim 4 or 5. 

7. A host cell comprising the vector of claim 6. 

8. The host cell of claim 7, wherein the host cell is a prokaryotic cell. 

9. The host cell of claim 8, wherein the prokaryotic cell is an E. coli cell. 

10. The host cell of claim 7, wherein the host cell is a eukaryotic cell. 

1 1 . A purified polypeptide comprising SEQ ID NO: 2. 

12. A purified polypeptide comprising an amino acid sequence at least about 
95% identical to the sequence of SEQ ID NO:2. 

13. A purified polypeptide comprising the amino acid sequence of SEQ ID 
NO:2 with one or more conservative amino acid substitutions. 

14. A nucleic acid encoding the polypeptide of claim 11. 
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15. The nucleic acid of claim 14, wherein the nucleic acid comprises SEQ ID 
NO: 4 

16. A vector comprising the nucleic acid of claim 14 or 15. 

17. A host cell comprising the vector of claim 16. 

18. The host cell of claim 17, wherein the host cell is a prokaryotic cell. 

19. The host cell of claim 18, wherein the prokaryotic cell is an E. coli cell. 

20. The host cell of claim 17, wherein the host cell is a eukaryotic cell. 

2 1 . An isolated antibody or fragment thereof that specifically binds the 
polypeptide of claim 1. 

22. The antibody of claim 21, wherein the antibody is a polyclonal antibody. 

23. The antibody of claim 21, wherein the antibody is a monoclonal antibody. 

24. The antibody of claim 21, wherein the antibody is labeled with a detectable 
moiety. 

25. The antibody of claim 24, wherein the detectable moiety is selected from the 
group consisting of a fluorescent moiety, an enzyme-linked moiety, a 
biotinylated moiety and a radiolabeled moiety. 

26. The antibody of claim 21, wherein the antibody is humanized. 

27. An isolated antibody or fragment thereof that specifically binds the 
polypeptide of claim 1 1 . 

28. The antibody of claim 27, wherein the antibody is a polyclonal antibody. 
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29. The antibody of claim 27, wherein the antibody is a monoclonal antibody. 

30. The antibody of claim 27, wherein the antibody is labeled with a detectable 
moiety. 

31. The antibody of claim 30, wherein the detectable moiety is selected from the 
group consisting of a fluorescent moiety, an enzyme-linked moiety, a 
biotinylated moiety and a radiolabeled moiety. 

32. The antibody of claim 27, wherein the antibody is humanized. 

33. A method for detecting the presence of cancer in a subject, comprising the 

steps of: 

(a) contacting a biological sample obtained from a subject with at least 
two oligonucleotide primers, each primer consisting of 10 to 200 
contiguous nucleotides of SEQ ID NO: 5 or the complement therof, in a 
reverse transcriptase polymerase chain reaction; and 

(b) detecting in the sample a polynucleotide sequence that amplifies in 
the presence of said oligonucleotide primers, wherein the presence of an 
amplified polynucleotide sequence indicates the presence of cancer in 
the subject. 

34. The method of claim 33, wherein the cancer is breast cancer. 

35. The method of claim 34, wherein the breast cancer is estrogen dependent 
breast cancer. 

36. The method of claim 33, wherein the primers are forward primer (5'- 
CAGAGCCTGT-3 ') (SEQ ID NO: 6) and reverse primer (5'- 
CTCTGGGACA-3 ') (SEQ ID NO: 7). 

37. A polynucleotide probe comprising a polynucleotide selected from the group 
consisting of at least 25 contiguous nucleotides of SEQ ID NO: 3, at least 25 
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contiguous nucleotides of SEQ ID NO: 4, at least 25 contiguous nucleotides 
of SEQ ID NO: 5 and at least 25 contiguous nucleotides of SEQ ID NO: 9. 

38. A method for detecting the presence of cancer in a subject, comprising the 
steps of: 

(a) contacting a biological sample obtained from a subject with the probe 
of claim 37 under conditions that allow the probe to selectively bind a 
BRHF1 nucleic acid; and 

(b) detecting the presence of a BRHF1 nucleic acid, whereby the 
presence of a BRHF1 nucleic acid indicates the presence of cancer in 
the subject. 



39. The method of claim 38, wherein the cancer is breast cancer. 



40. The method of claim 39, wherein the breast cancer is estrogen dependent 
breast cancer. 

41 . The method of claim 38, further comprising: 

d) measuring the amount of BRHF1 nucleic acid in the sample and 
correlating this amount with a particular stage of cancer. 

42. A method of detecting the presence of cancer in a subject comprising: 

a) contacting a sample from the subject with an antibody to a BRHF1 
polypeptide; and 

b) detecting the antibody bound to the BRHF1 polypeptide in the 
sample, wherein binding of BRHF1 polypeptide to the antibody indicates 
the presence of a BRHF1 polypeptide in the sample, the presence of a 
BRHF1 polypeptide indicating the presence of cancer in the subject. 



52 



ATTORNEY DOCKET NO. 21085.0064U1 



43. The method of claim 42, wherein the cancer is breast cancer. 

44. The method of claim 43, wherein the breast cancer is estrogen dependent 
breast cancer. 

45. The method of claim 42 further comprising: 

d) measuring the amount of BRHF1 polypeptide in the sample and 
correlating this amount with a particular stage of cancer. 

46. A method of reducing BRHF1 expression in a cell comprising administering 
to the cell an antisense oligonucleotide that specifically binds to mRNA 
transcribed from the BRHF1 gene under conditions that allow hybridization, 
wherein the BRHF1 mRNA comprises a nucleotide sequence selected from 
the group consisting of SEQ ID NO: 3, SEQ ID NO 4 and SEQ ID NO: 9 
and wherein hybridization of the antisense oligonucleotide with the BRHF1 
mRNA reduces BRHF1 expression. 

47. A method of reducing BRHF1 expression comprising administering to a cell 
a ribozyme that specifically binds to mRNA transcribed from the BRHF1 
gene, the ribozyme binding reducing BRHF1 expression. 

48. A method of reducing BRHF1 expression comprising administering to a cell 
an siRNA that is complementary to at least a portion of the coding sequence 
of BRHF1, under conditions that allow hybridization of the siRNA with the 
BRHF1 coding sequence, wherein the BRHF1 coding sequence comprises a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 3, 
SEQ ID NO: 4 and SEQ ID NO: 9, and wherein the binding of the siRNA to 
the BRHF1 coding sequence reduces BRHF1 expression. 

49. The method of claim 46, 47 or 48, wherein the cell is in a subject. 

50. A method of identifying a compound that reduces BRHF1 expression, 
comprising administering a test compound to a cell containing a BRHF1 
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gene and detecting the level of the BRHFlgene product produced, a 
decrease in the gene product as compared to a control cell indicating the 
compound reduces BRHF1 expression. 

5 1 . A method of identifying a compound that reduces BRHF1 expression, 
comprising administering a test compound to a cell containing a nucleic acid 
encoding the polypeptide of claim 1 and detecting the level of the 
BRHFlgene product produced, a decrease in the gene product indicating a 
compound that reduces BRHF1 expression. 

52. A method of identifying a compound that reduces BRHF1 expression in the 
presence of estrogen, comprising administering a test compound and 
estrogen to a cell containing a BRHF1 gene and detecting the level of the 
BRHFlgene product produced, a decrease in the gene product as compared 
to a control cell indicating a compound that reduces BRHF1 expression in 
the presence of estrogen. 

53. A method of identifying a compound that reduces BRHF1 expression in the 
presence of estrogen, comprising administering a test compound and 
estrogen to a cell containing a nucleic acid encoding the polypeptide of 
claim 1 and detecting the level of the BRHFlgene product produced, a 
decrease in the gene product indicating a compound that reduces BRHF1 
expression in the presence of estrogen. 
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ABSTRACT 



This invention relates generally to the discovery of a novel gene, designated 
BHRF1. The invention further relates to the detection of the BHRF1 gene, BHRF1 
expression and/or the presence of the BHRF1 protein as potential markers for the 
early diagnosis of cancer. 
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TCA GGA AAC AAC 

C W W D 
TGT TGG TGG GAC 

L R D L 
CTC AGG GAT CTA 

I Y P N 
ATA TAC CCA AAT 

F I A A 
TTT ATT GCG GCA 

T M I D 
ACA ATG ATA GAC 

A A I K 
GCA GCC ATA AAA 

E T I I 
GAA ACC ATC ATT 

F S L I 
TTC TCA CTC ATA 



GCA ACA AAA GAC 

TAKE 
ACA GCA AAA GAA 

I F A T 
ATT TTT GCA ACC 

L K Q I 
CTC AAA' CAA ATT 

D M N R 
GAC ATG AAC AGA 

K K C S 
AAG' 'AAA TGC TCA 

R Y K L 
7\/-i7\ —nnr put PTC 

R C W R 
AGG TGC TGG AGA 

C K L V 
TGT AAA CTA GTT 

E L E I 
GAA CTA GAA ATA 

E Y K S 
GAG TAT AAA TCA 

L F T I 
CTA TTC ACA ATA 

W I K K 
TGG ATT AAG AAA 

N D E F 

AAT GAT GAG TTC 

L S K L. 
CTC AGT AAA CTA 

G G N * 
GGT GGG AAT TGA 



AAA ATT 3 02 4 

T T 1026 

ACT ACC 3078 

Y S 1044 
TAC TCA 313 2 

Y K 1062 
TAC AAG 31B6 

H F 1080 

CAC TTC 3240 

S S 1098 
TCA TCA - 3 2 94 

T P 1116 

ACA CCA 3 34 8 

G C 1134 

GGA TGC 3402 

Q P 1152 

CAA CCA 3456 

P F 1170 

CCA TTT 3 510 

C C 1188 

TGC. TGC 3 5 64 

A K 1206 

GCA AAG 3 618 

M' W 1224 

ATG TGG 3 672 

I S 1242 

ATA TCC 372 6 

S Q 1260 

TCG CAA 3 7 80 

1276 
3828 



FIGURE 6 (cont'd) 
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SEQUENCE LISTING 

<110> Roy, Deodutta 

<12 0> BRHF1 AS A CANCER DIAGNOSTIC MARKER 



<130> 21085. 0064U1 

<160> 14 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<:211> 105 

<2 12 > PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
Synthetic Construct 

<400> 1 



Met 


Lys 


Gly 


Leu 


Ser 


Pro 


He 


Ala 


Lys 


Gly Arg 


Lys 


Thr 


Ser 


Val 


Ser 


1 








5 










10 








15 




Ala 


Ala 


Val 


Leu 
20 


Val 


Ser 


Thr 


Thr 


He 
25 


Pro He 


Ser 


Ser 


Val 
30 


Trp 


Gly 


Pro 


Leu 


Gin 


He 


Leu 


Gly 


Gin 


Lys 


Arg 


Gly Gin Lys 


Met 


Glu 


Gin 


Ala 






35 










40 








45 








Asn 


His 
50 


Pro 


Val 


Gly 


Leu 


Asp 
55 


He 


Ser 


Val Val 


Tyr 
60 


Lys 


Asp 


Thr 


Leu 


Lys 


Lys 


He 


val 


Gin 


Gin 


Glu 


Thr 


Ser 


Cys Pro 


Phe 


Thr 


His 


Val 


His 


65 










70 








75 










80 


Tyr 


Ala 


Glu 


Gly 


He 


Thr 


Gly Arg 


His 


Thr Ala 


Pro 


Glu 


Asp 


Glu Gly 










85 










90 








95 




Ser 


Leu 


Ala 


Gin 
100 


Lys 


Pro 


Pro 


He 


Arg 
105 















<210> 2 
<211> 270 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
Synthetic Construct 

<400> 2 

Met Asn He Asp Ala Lys He Leu Asn Lys He Leu Ala Asn Gin He 

15 10 15 

Gin Gin His He Lys Lys Leu He His His Asp Gin Val Gly Phe He 

20 25 30 

Pro Gly Met Gin Gly Trp Phe Asn He His Lys Ser He Asn Val He 

35 40 45 

Gin His He Asn Arg Thr Lys Asp Lys Asn His Met He He Ser Val 

50 55 60 

Asp Ala Glu Lys Ala Phe Asp Lys Val Gin Gin His Phe Met Leu Lys 
65 70 75 80 
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Thr Leu Asn Lys Leu Gly lie Asp Gly Thr Tyr Leu Lys lie lie Arg 

85 90 95 

Ala lie Tyr Asp Lys Pro Thr Ala Asn lie lie Leu Asn Gly Leu Lys 

100 105 110 

Leu Glu Ala Phe Pro Leu Lys Thr Gly Thr Arg Gin Gly Cys Pro Leu 

115 120 125 

Ser Leu Leu Leu Phe Asn lie Val Leu Glu Val Leu Ala Arg Ala lie 

130 135 140 

Arg Gin Glu Lys Glu lie Asn Cys He Gin Leu Gly Lys Glu Glu Val 
145 150 155 160 

Lys Leu Pro Leu Phe Ala Asp Asp Met He Val Tyr Leu Glu Asn Pro 

165 170 175 

Val Val Ser Ala Pro Asn Leu Leu Lys Leu He Ser Asn Phe Ser Lys 

180 185 190 

Val Ser Gly Tyr Lys He Asn Val Gin Lys Ser Gin Ala Phe Leu Tyr 

195 200 205 

Thr Asn Asn Arg Gin Thr Glu Ser Gin He Met Ser Glu Leu Pro Phe 

210 215 220 

Thr He Ala Ser Lys Arg He Lys Tyr Leu Gly He Gin Leu Thr Arg 
225 230 235 240 

Asp Val Lys Asp Leu Phe Lys Glu Asn Tyr Lys Pro Leu Leu Asn Glu 

245 250 255 

He Lys Glu Asp Thr Asn Lys Cys Lys Asn He Pro Cys Ser 
260 265 270 



<210> 3 
<211> 315 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
Synthetic Construct 



<400> 3 

atgaagggat tatcgcctat cgccaagggg 
gtgagcacaa ctattccgat cagcagtgtc 
agaggacaga aaatggagca ggccaatcac 
aaggacacct taaaaaagat tgtccaacaa 
tatgctgagg gaatcactgg aaggcacact 
aagcccccaa tcaga 



aggaaaacta gtgtttctgc tgctgtgttg 60 
tggggaccat tgcagattct tgggcaaaag 12 0 
ccagtggggc ttgatatcag tgtggtttac 180 
gaaacaagct gccccttcac ccatgtccac 24 0 
gccccagagg atgaaggttc tctggcccag 3 00 

315 



<210> 4 
<211> 810 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
Synthetic Construct 



<400> 4 

atgaacatcg atgcaaaaat 
aaaaagctta tccaccatga 
atacacaaat caataaatgt 
attatctcag tagatgcaga 
actctcaata aattaggtat 
aaacccacag ccaatatcat 
ggcacaagac agggatgccc 
gccagggcaa tcaggcagga 



cctcaataaa atactggcaa 
tcaagtgggc ttcatccctg 
aatccagcat ataaacagaa 
aaaggccttt gacaaagttc 
tgatgggacg tatctcaaaa 
actgaatggg ctaaaactgg 
tctctcactt ctcctattca 
gaaggaaata aattgtattc 



accaaatcca gcagcacatc 60 
ggatgcaagg ctggttcaac 12 0 
ccaaagacaa aaaccacatg 18 0 
aacaacactt catgctaaaa 240 
taataagagc tatctatgac 300 
aagcattccc tttgaaaact 360 
acatagtgtt ggaagttctg 42 0 
aattaggaaa agaggaagtt 480 



2 



ATTORNEY DOCKET NO. 21085. 0064U1 



aaattgcccc 
ccaaatctcc 
caaaaatcac 
gaactcccat 
gatgtgaagg 
acaaacaaat 



tgtttgcaga 
ttaagctgat 
aagcattctt 
tcacaattgc 
acctcttcaa 
gcaagaacat 



tgacatgatt 
aagcaacttc 
atacaccaat 
ttcaaagaga 
ggagaactac 
tccatgctca 



gtatatctgg 
agcaaagtct 
aacagacaaa 
ataaaatacc 
aaaccactgc 



aaaaccccgt 
caggatacaa 
cagagagcca 
taggaatcca 
tcaacgaaat 



cgtctcagcc 54 0 
aatcaacgtg 600 
aatcatgagt 660 
acttacaagg 72 0 
aaaagaggat 78 0 
810 



<210> 5 

<211> 1263 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
Synthetic Construct 



<400> 5 

datctacaga 

cacctattcc 

aacagaaatt 

taagaatctc 

ctactgggta 

caaagacacc 

tatagcacta 

acaattaaaa 

aataactaaa 

caatgaatcc 

actaataaag 

tatcaccacc 

cgcaaataaa 

aagactaaac 

ggcaataatc 

attctaccag 

agaaaaagag 

gccgggcaga 

tgcaaaaatc 

ccaccatgat 

aataaatgta 

aga 



actctccacc 
aaaattgacc 
ataacaaact 
actcaaagcc 
cataacgaaa 
acataccaga 
aatgcctacc 
gaactagaaa 
atcagagcag 
aggagctggt 
aaaaaaagag 
gatcccacag 
ctagaaaatc 
caggaagaag 
aatagtttac 
aggtacaagg 
ggaatcctcc 
gacacaacca 
ctcaataaaa 
caagtgggct 
atccagcata 



ccaaatcaac 
acatagttgg 
atctctcaga 
gctcaactac 
tgaaggcaga 
atctctggga 
agagaaagca 
agcaagagca 
aactgaagga 
tttttgaaag 
agaagaatca 
aaatacaaac 
tggaagaaat 
ttgaatctct 
caaccaaaaa 
aggaactggt 
ctaactcatt 
aaaaagagaa 
tactggcaaa 
tcatccctgg 
taaacagagc 



agaatataca 
aagtaaagct 
ccacagtgca 
atggaaactg 
aataaagatg 
cgcattcaaa 
ggaaagatcc 
aacacattca 
aatagagaca 
gatcaacaaa 
aatagacaca 
taccatcaga 
ggatacattc 
gaatcgacca 
gagtccagga 
accattcctt 
ttatgagacc 
ttttagacca 
ccgaatccag 
gatgcaaggc 
caaagacaaa 



tttttttcag 
ctcctcagca 
atcaaactag 
aacaacctgc 
ttctttgaaa 
gcagtgtgta 
aaaattgaca 
aaagctagca 
caaaaaaccc 
attgatagac 
ataaaaaatg 
gaatactaca 
ctcgacacat 
ataacaggct 
ccagatggat 
ctgaaactat 
agcatcattc 
atatccttga 
cagcacatca 
tggttcaata 
aaccacatga 



caccacacca 
aatgtaaaag 
aactcaggat 
tcctgaatga 
ccaacgagaa 
gagggaaatt 
ccctaacatc 
gaaggcaaga 
ttcaaaaaat 
cgctagcaag 
ataaagggga 
aacacctcta 
acactctccc 
ctgaaattgt 
tcacagccga 
tccaatcaat 
tgataccaaa 
tgaacattga 
aaaagcttat 
tacgcaaatc 
ttatctcaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1263 



<210> 6 

<211> 10 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : /note = 
Synthetic Construct 

<400> 6 

cagagcctgt 10 



<210> 7 

<211> 10 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
Synthetic Construct 
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<400> 7 
ctctgggaca 

<210> 8 

<211> 375 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
Synthetic Construct 



<400> 8 



Mp f- 


Lys 


m v 

uiy 


Leu 




Drn 
r 1 U 


Tip 

X X c 


Al a 

ril CL 


T 

1 








CT 








rild 


AT a 


Va 1 


JLeu 


V ct X 


OCX. 




X XXX 








2 0 










Jr 1 W 


_ 


m n 

\J 1 11 


J. XC 




uiy 


VJXXX 


xjy o 






35 










40 


As n 


His 


Piro 


Val 


Gly 


Leu. 


Asp 


He 




50 










55 






Lys 


Tl e> 


v ai 


m n 


will 


uxu 


X 1XX 


65 










70 






±yr 


A"l a 


nl n 

OJ. u. 


fZl V 


-1 — LC 


± in. 


\jxy 












85 








OCI 




Al a 
nla 


(31 n 


Lys 


Jr -L \J 


jr 1 LJ 


Tip 

X X c 








100 










■UC LL 


Asn 


T .\/ C! 

uy o 


Tip 
X X C 




Ala 


Asn 


Gin 






115 










12 0 


11C 


n _l o 


U-j cj 


Asp 




V CL 1 




Php 

ir lie 




130 










13 5 




Asn 


lie 


His 


Lys 


Ser 


He 


Asn 


Val 


14 5 










15 0 






A 


Lys 


Asn 


His 


Met 


He 


He 


Ser 










X D 3 








T ,A7C 
l-i i3 


V Ct J. 


Gin 


m n 


Hi s 


Phe 


L'lc L- 


Leu 








180 










Asp 


vj± y 




Tyr 


Leu 


Lys 


Tip 


Tl P 
lie 
















200 


Ala 

nl ct 


Asn 


He 


Tip 
X XC 


T.pn 

XJC LL 


Asn 


fi 1 V 

«xy 


Leu 




210 










215 




rnU -v- 

i nir 


taj.y 




Arg 


cl n 




Cys 


Pro 


225 










230 






val 


Leu 


Glu 


Val 


Leu 


Ala 


Arg 


Ala 










245 








Cys 


He 


Gin 


Leu 


Gly 


Lys 


Glu 


Glu 








260 










Asp 


Met 


He 


Val 


Tyr 


Leu 


Glu 


Asn 






275 










280 


Leu 


Lys 


Leu 


He 


Ser 


Asn 


Phe 


Ser 




290 










295 




Val 


Gin 


Lys 


Ser 


Gin 


Ala 


Phe 


Leu 


305 










310 






Ser 


Gin 


He 


Met 


Ser 


Glu 


Leu 


Pro 










325 








Lys 


Tyr 


Leu 


Gly 


He 


Gin 


Leu 


Thr 








340 










Glu 


Asn 


Tyr 


Lys 


Pro 


Leu 


Leu 


Asn 






355 










360 


Cys 


Lys 


Asn 


He 


Pro 


Cys 


Ser 






370 










375 





Sequence : /note = 



T .wo 
uy o 


uxy 


Arg 


uy 9 


X 11X 


Car 
OCX 


Va 1 

V CL X 


OCX 




10 










15 






Jr*x \J 


He 




Ser 


Val 


xx p 


Gly 


25 










3 0 








uxy 


m n 

L3X11 


T iVQ 

xjy o 


nc l. 


m ii 

UXU 


Gin 


Ala 










45 








Ser 


Val 


Val 


xyx 


Lys 


Asp 


Thr 


Leu 








60 










Ser 


<_y a 


Pro 


Phe 


Thr 


His 


Val 


His 






75 










80 


Rl C5 
XXX S 


X 11X 


Al a 

r-*x CL 


riu 


m ii 

L7X LL 


A en 

AS LJ 


Glu 


Gly 




90 










95 




ruy 


Mpt- 
I'lC L. 


Asn 


Tl f> 

X X c 


Asp 


Ala 


Lys 


Tie 

X X c 


105 










110 






He 


Gin 


Gin 


His 


He 


Lys 


Lys 


Leu 










12 5 








He 


Pro 


Gly 


Met 


Gin 


uxy 


iijy 


Phe 








140 










He 


Gin 


His 


He 


Asn 


Ara 


Thr 


Lys 






155 










160 


Val 


Asp 


Ala 


Glu 


Lys 


Ala 


Phe 


Asp 




170 










175 




Lys 


Thr 


Leu 


Asn 


Lys 


Leu 


Gly 


He 


185 










190 






Arg 


nla 


Tip 
lie 


Tyr 


Asp 


xjy o 


Jr X \J 


T 1 V"l T" 
X XIX 










205 








Lys 


Leu 


Glu 


Ala 


Phe 


Pro 


Leu 


Lys 








220 










Leu 


Ser 


Leu 


Leu 


Leu 


Phe 


Asn 


He 






235 










240 


He 


Arg 


Gin 


Glu 


Lys 


Glu 


He 


Asn 




250 










255 




Val 


Lys 


Leu 


Pro 


Leu 


Phe 


Ala 


Asp 


265 










270 






Pro 


Val 


Val 


Ser 


Ala 


Pro 


Asn 


Leu 










285 








Lys 


Val 


Ser 


Gly Tyr 


Lys 


He 


Asn 








300 










Tyr 


Thr 


Asn 


Asn 


Arg 


Gin 


Thr 


Glu 






315 










320 


Phe 


Thr 


He 


Ala 


Ser 


Lys 


Arg 


He 




330 










335 




Arg 


Asp 


Val 


Lys 


Asp 


Leu 


Phe 


Lys 


345 










350 






Glu 


He 


Lys 


Glu 


Asp 


Thr 


Asn 


Lys 



365 
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<210> 9 
<211> 1128 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
Synthetic Construct 



<400> 9 

atgaagggat 

gtgagcacaa 

agaggacaga 

aaggacacct 

tatgctgagg 

aagcccccaa 

atccagcagc 

caaggctggt 

gacaaaaacc 

cacttcatgc 

agagctatct 

ttccctttga 

gtgttggaag 

ggaaaagagg 

cccgtcgtct 

tacaaaatca 

agccaaatca 

atccaactta 

gaaataaaag 



tatcgcctat 
ctattccgat 
aaatggagca 
taaaaaagat 
gaatcactgg 
tcagaatgaa 
acatcaaaaa 
tcaacataca 
acatgattat 
taaaaactct 
atgacaaacc 
aaactggcac 
ttctggccag 
aagttaaatt 
cagccccaaa 
acgtgcaaaa 
tgagtgaact 
caagggatgt 
aggatacaaa 



cgccaagggg 
cagcagtgtc 
ggccaatcac 
tgtccaacaa 
aaggcacact 
catcgatgca 
gcttatccac 
caaatcaata 
ctcagtagat 
caataaatta 
cacagccaat 
aagacaggga 
ggcaatcagg 
gcccctgttt 
tctccttaag 
atcacaagca 
cccattcaca 
gaaggacctc 
caaatgcaag 



aggaaaacta 
tggggaccat 
ccagtggggc 
gaaacaagct 
gccccagagg 
aaaatcctca 
catgatcaag 
aatgtaatcc 
gcagaaaagg 
ggtattgatg 
atcatactga 
tgccctctct 
caggagaagg 
gcagatgaca 
ctgataagca 
ttcttataca 
attgcttcaa 
ttcaaggaga 
aacattccat 



gtgtttctgc 
tgcagattct 
ttgatatcag 
gccccttcac 
atgaaggttc 
ataaaatact 
tgggcttcat 
agcatataaa 
cctttgacaa 
ggacgtatct 
atgggctaaa 
cacttctcct 
aaataaattg 
tgattgtata 
acttcagcaa 
ccaataacag 
agagaataaa 
actacaaacc 
gctcatag 



tgctgtgttg 
tgggcaaaag 
tgtggtttac 
ccatgtccac 
tctggcccag 
ggcaaaccaa 
ccctgggatg 
cagaaccaaa 
agttcaacaa 
caaaataata 
actggaagca 
attcaacata 
tattcaatta 
tctggaaaac 
agtctcagga 
acaaacagag 
atacctagga 
actgctcaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1128 



<210> 10 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
Synthetic Construct 



<400> 10 

tgagcacaac tattccgatc 



20 



<210> 11 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
Synthetic Construct 



<400> 11 

aagcaacttc agcaaagtct cag 



23 



<210> 12 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : /note 
Synthetic Construct 

<400> 12 

aaaccactgc tcaacgaaat a 

<210> 13 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
Synthetic Construct 

<400> 13 

aagggattat cgcctatcgc c 

<210> 14 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
Synthetic Construct 

<400> 14 
ccgcatctac 



